Cholesterol oxidase from Brevibacterium sterolicum

ABSTRACT

The invention concerns a cholesterol oxidase, a process for the production of a recombinant cholesterol oxidase, a DNA sequence suitable for this process which causes a cytoplasmatic expression of the recombinant cholesterol oxidase in a host bacterium as well as the recombinant cholesterol oxidase obtianed in this way.

The invention concerns a cholesterol oxidase from Brevibacterium sterolicum, a process for the production of a recombinant cholesterol oxidase from Brevibacterium sterolicum, a suitable DNA sequence for this process which results in a cytoplasmic expression of the recombinant cholesterol oxidase in the host bacterium as well as the recombinant cholesterol oxidase obtained in this manner.

Cholesterol oxidase is of major importance for the enzymatic determination of cholesterol. It catalyzes the oxidation of cholesterol to cholesten-3-one and H₂ O₂. Cholesterol oxidase from various organisms such as Pseudomonas, Mycobacterium, Nocardia, Arthrobacter and Brevibacterium have already been described (T. Uwajima et al., Agr. Biol. Chem. 37 (1973), 2345-2350). All these known cholesterol oxidases are secreted proteins. The soil bacterium Brevibacterium sterolicum KY 3643 (ATCC 21387) has a particularly high activity of cholesterol oxidase. Three isoenzymes of cholesterol oxidase are known from this bacterium which differ in their isoelectric point, substrate specificity towards various steroids, affinity for cholesterol at the pH optimum and in their DNA and amino acid sequence (EP-A 0 452 112 and EP-A 560 983). Cholesterol oxidase I from Brevibacterium sterolicum has a low affinity for cholesterol (K_(M) 1.1×10⁻³ mol/l) and can only be obtained in a low yield from Brevibacterium sterolicum. It has already been attempted to express a complete DNA coding for cholesterol oxidase I in E. coli, but this has not yet succeeded (K. Fujishiro et al., Biochem. Biophys. Res. Com. 172 (1990), 721-727, T. Ohta et al., Gene 103 (1991), 93-96). The expression of special deletion mutants of the DNA coding for cholesterol oxidase I which were fused with parts of the lac z gene also did not lead to a satisfactory expression in E. coli (T. Ohta et al., Biosci. Biotech. Biochem. 56 (1992), 1786-1791). The cloning and expression of further cholesterol oxidases from Brevibacterium sterolicum is described in EP-A 0 452 112. However, expression of these DNAs likewise does not lead to an adequate amount of active cholesterol oxidase.

The object of the invention was to provide a cholesterol oxidase with a high affinity for cholesterol in large amounts and in an active form.

This object is achieved by a cholesterol oxidase which has the amino acid sequence shown in SEQ ID NO 2. This cholesterol oxidase is obtainable from Brevibacterium sterolicum or can also be produced by recombinant means.

It has surprisingly turned out that such a cholesterol oxidase can be produced recombinantly in a large amount and in an active form. This cholesterol oxidase has a molecular weight of 60 kD, an isoelectric point of ca. 5.5 (each measured in the Phast System, Pharmacia LKB) and a K_(M) value for cholesterol of 1×10⁻⁴ mol/l (in 0.5 mol/l potassium phosphate buffer pH 7.5 at 25° C.) and is active in a pH range of 5.5 to 8.0.

It has turned out that this cholesterol oxidase can be obtained in a large amount and in an active form when a DNA is used for a heterologous expression which codes for a peptide with cholesterol oxidase activity and is selected from the group

a) the DNA sequence shown in SEQ ID NO 1 or the DNA sequence which is complementary thereto,

b) DNA sequences which hybridize with the DNA sequence shown in SEQ ID NO 1 or with fragments of this DNA sequence,

c) DNA sequences which, without degeneracy of the genetic code, would hybridize with the sequences defined in a) or b) and which code for a polypeptide with the same amino acid sequence,

wherein this DNA has one of the sequences shown in SEQ ID NO 3, 4 and/or 5. A DNA is preferably used which has the sequence shown in SEQ ID NO 1. However, it is also possible to replace degenerated codons by other codons that code for the same amino acid in a manner familiar to a person skilled in the art. Furthermore codons coding for additional amino acids can be added at the 5' end, at the 3' end or also within the sequence shown in SEQ ID NO 1 provided the DNA variants obtained in this way hybridize with the DNA sequence shown in SEQ ID NO 1 under the usual conditions (see T. Maniatis et al., Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 1989). In addition the DNA used should have one of the sequences shown in SEQ ID NO 3, 4 and/or 5 and code for a peptide with cholesterol oxidase activity. A peptide with cholesterol oxidase activity is understood as a peptide which catalyzes the oxidation of cholesterol (5-cholesten-3-β-ol) to 4-cholesten-3-one and H₂ O₂.

The invention therefore also concerns a DNA which codes for a peptide with cholesterol oxidase activity and is selected from the group

a) the DNA sequence shown in SEQ ID NO 1 or the DNA sequence which is complementary thereto,

b) DNA sequences which hybridize with the DNA sequence shown in SEQ ID NO 1 or with fragments of this DNA sequence,

c) DNA sequences which, without degeneracy of the genetic code, would hybridize with the sequences defined in a) or b) and which code for a polypeptide with the same amino acid sequence,

wherein this DNA has one of the sequences shown in SEQ ID NO 3, 4 and/or 5.

With such a DNA it is possible to obtain an at least 10-fold higher activity of the recombinantly produced cholesterol oxidase in a crude extract than with the previously described processes and cholesterol oxidases.

The invention in addition concerns a process for the production of a recombinant cholesterol oxidase by transformation of a suitable host cell with a DNA according to the invention which is present in a suitable expression system, culturing the transformed host cells and isolating the cholesterol oxidase formed from the cytoplasm of the transformed cells.

With this process it is surprisingly possible to obtain a recombinant cholesterol oxidase in a large amount and in an active form from the cytoplasm of the transformed host cell. In this process the DNA used can contain an additional nucleotide sequence at the 5' end which has a translation start codon but no stop codon wherein this additional nucleotide sequence does not lead to a shift in the reading frame and does not represent a functionally active signal sequence for the secretion of the recombinant enzyme formed. The length of this nucleotide sequence is about 3 to 90 base pairs.

The additional nucleotide sequence preferably has one of the sequences shown in the sequence protocols 6, 8, 10, 12, 14 and 16 instead of the native signal sequence.

A preferred subject matter of the invention is therefore a process for the production of a recombinant cholesterol oxidase in which a DNA according to the invention is used which has one of the sequences shown in SEQ ID NO 6, 8, 10, 12, 14 or 16 at the 5' end.

The host cells used for the recombinant production are transformed according to known methods (see e.g. Sambrook, Fritsch and Maniatis, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor 1989). The transformed host cells are then cultured under conditions which allow expression of the cholesterol oxidase gene. Depending on the expression vector used, it may be expedient in a well-known manner to add an inductor (e.g. lactose or isopropyl-β-D-thiogalacto-pyranoside (IPTG)) to the culture medium to increase the temperature or to limit the supply of glucose. Isolation of the recombinant cholesterol oxidase from the cytoplasm of the transformed cells is then achieved according to known methods.

With this process it is possible to obtained the cholesterol oxidase according to the invention as a recombinant enzyme in a yield of 8-20 U/ml. In contrast expression of the complete cholesterol oxidase gene which contains the signal sequence only results in a yield of less than 0.1 U/ml.

A preferred subject matter of the invention is a DNA according to the invention coding for a peptide with cholesterol oxidase activity which has one of the sequences shown in SEQ ID NO 6, 8, 10, 12, 14 and 16 at the 5' end. The sequences shown in the sequence protocols 18, 20, 22, 24, 26 and 28 are particularly preferred. These DNA sequences according to the invention are preferably present cloned in an expression vector. This DNA can be used to obtain the cholesterol oxidase according to the invention in any amount in bacteria that are conventionally used for the recombinant production of proteins. The expression is preferably carried out in E. coli.

The invention therefore also concerns a recombinant cholesterol oxidase which is coded by a DNA according to the invention and has one of the amino acid sequences shown in SEQ ID NO 7, 9, 11, 13, 15 or 17 at the N-terminal end.

This recombinant cholesterol oxidase is equally as suitable as the other cholesterol oxidases known from the state of the art for an enzymatic test for the determination of cholesterol. If necessary recognition sequences for specific proteases such as e.g. IgA protease, enterokinase or factor Xa can be integrated between these N-terminal sequences and the amino acid sequence of the mature cholesterol oxidase by in vitro mutagenesis in a manner familiar to a person skilled in the art so that even after cytoplasmic expression of a cholesterol oxidase extended by these N-terminal sequences it is possible to cleave off such fused N-terminal sequences.

A preferred subject matter of the invention is a recombinant cholesterol oxidase which has the amino acid sequence shown in SEQ ID NO 21, 23, 25, 27 or 29 as well as the use of such a recombinant cholesterol oxidase in an enzymatic test for the detection of cholesterol. In this process the H₂ O₂ formed in the cholesterol oxidase reaction is preferably determined in a subsequent indicator reaction as a measure of the cholesterol present in the sample.

The plasmids pUC-chol-B2-BB (DSM 8274), pmgl-SphI (DSM 8272) and pfl-20AT1-SD (DSM 8273) mentioned in the examples were deposited on May 05, 1993 at the "Deutsche Sammlung fur Zellkulturen und Mikroorganismen GmbH", Mascheroder Weg 1b, D-3300 Braunschweig.

The application is elucidated in more detail by the following examples in conjunction with the sequence protocols and figures.

SEQ ID NO 1 shows the nucleic acid sequence of the cholesterol oxidase according to the invention.

SEQ ID NO 2 shows the amino acid sequence of the cholesterol oxidase according to the invention.

SEQ ID NOS 3-5 show nucleotide sequences from DNAs according to the invention coding for a peptide with cholesterol oxidase activity.

SEQ ID NOS 6-17 show the N-terminal sequences of recombinant cholesterol oxidase genes according to the invention (SEQ ID NOS 6, 8, 10, 12, 14 and 16) and the N-terminal amino acid sequences thereof (SEQ ID NOS 7, 9, 11, 13, 15 and 17). SEQ ID NOS 18-29 show the nucleic acid sequences and amino acid sequences thereof of cholesterol oxidases according to the invention.

They denote the following:

    ______________________________________     Signal sequence                 Complete sequence                               Construct     ______________________________________     SEQ ID NO 6-7                 SEQ ID NO 18-19                               plac-Chol-cyt     SEQ ID NO 8-9                 SEQ ID NO 20-21                               ppfl-Chol-cyt     SEQ ID NO 10-11                 SEQ ID NO 22-23                               ppfl-MSN3H-Chol-cyt     SEQ ID NO 12-13                 SEQ ID NO 24-25                               ppfl-MSN4H-Chol-cyt     SEQ ID NO 14-15                 SEQ ID NO 26-27                               ppfl-MSN4R2K-Chol-cyt     SEQ ID NO 16-17                 SEQ ID NO 28-29                               ppfl-MVM3H-Chol-cyt     ______________________________________

SEQ ID NOS 30-33 show four oligonucleotides for amplification of a fragment of the cholesterol oxidase gene according to the invention.

SEQ ID NO 34 shows the sequence of an adapter oligonucleotide for the in vitro mutagenesis of the cholesterol oxidase gene according to example 5.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the plasmid pUC-Chol-B2-BB.

FIG. 2 shows the plasmid plac-Chol-cyt.

FIG. 3 shows the plasmid ppfl-Chol-cyt.

FIG. 4 shows the plasmid ppfl-MSN3H-Chol-cyt.

EXAMPLE 1 Cloning of the gene for cholesterol oxidase from Brevibacterium sterolicum

Brevibacterium sterolicum (BMTU 2407) is cultured in 500 ml nutrient broth (Difco) for 20 h at 30° C. The cells are harvested by centrifugation. The cell mass obtained in this way is resuspended in 20 mmol/l Tris/HCl pH 8.0 to 0.4 g cell wet weight/ml. 5 ml 24% (w/v) polyethylene glycol 6000, 2.5 ml 20 mmol/l Tris/HCl pH 8.0 and 10 mg lysozyme are added to 2.5 ml of this suspension and it is incubated for 14 h at 4° C. Then the cells are lysed by addition of 1 ml 20% (w/v) SDS and 2 mg protease K and incubation for 1 h at 37° C. An equal volume of 20 mmol/l Tris/HCl pH 8.0 is added to this solution and then 1 g CsCl and 0.8 g ethidium bromide are added per ml. This solution is separated by ultracentrifugation for 24 h at 40,000 rpm in a TV850 vertical rotor (DuPont). The DNA band is then withdrawn with an injection syringe. The removal of ethidium bromide and ethanol precipitation of the DNA is carried out as described in Sambrook et al., Molecular Cloning, A Laboratory Manual (1989).

7 μg of the DNA obtained in this manner is partially cleaved with the restriction endonuclease NlaIII (New England Biolab), separated electrophoretically on a 0.8% agarose gel and a size region of ca. 2-12 kb is cut out. The DNA fragments are isolated from the gel, cleaved with SphI and subsequently ligated into a plasmid vector pUC19 treated with alkaline phosphatase from calf intestine. This ligation preparation is transformed in competent E. coli K12 XL1-blue (Stratagene, Catalogue No. 200268). The transformed cells are plated on agar plates with LB medium containing 100 μg/ml ampicillin and incubated overnight at 37° C. The fully grown colonies are transferred onto nitrocellulose filters (Schleicher and Schull), lysed by treatment with toluene/chloroform vapour and the colony side of the filter is transferred onto indicator plates (see below). Cholesterol oxidase activity is tested on these indicator plates by a 15- to 30-minute incubation at room temperature.

Clones which show a colour reaction are selected and isolated. As a control these E. coli clones are streaked onto an agar plate with LB medium containing 100 μg/ml ampicillin, incubated overnight at 37° C., for verification the colonies that have grown on are again transferred onto two different nitrocellulose filters and lysed as described above with toluene/chloroform vapour. A filter is again placed on one of the indicator plates described above and the other filter is placed on an indicator plate without cholesterol. A positive colour reaction was only seen on the complete indicator plates containing the substrate cholesterol. This therefore demonstrates that the colour reaction caused by the corresponding E. coli clone is in fact due to active cholesterol oxidase.

Preparation of the indicator plates:

For the plate test to determine cholesterol oxidase activity, 100 ml 2% low-melting-point agarose (Sea Plaque BIOzym 50113) is melted and a solution of:

48 mg 4-aminoantipyrine (Boehringer Mannheim GmbH, Catalogue No. 073474)

306 mg EST (N-ethyl-N-sulfoethyl-3-methylaniline potassium salt (Boehringer Mannheim GmbH, Catalogue No. 586854))

2.5 mg horseradish peroxidase, degree of purity II (ca. 260 U/mg (Boehringer Mannheim GmbH, Catalogue No. 005096))

60 μl sodium azide solution (20%)

10 ml 1 mol/l potassium phosphate pH 7.2

150 mg cholic acid sodium salt (Merck, Catalogue No. 12448)

10 ml cholesterol substrate solution (see below)

H₂ O to a volume of 100 ml

pre-warmed to a temperature of 42° C. is added to the melted agarose, carefully mixed, 10 ml portions are poured into Petri dishes and kept in the dark for storage.

Cholesterol substrate solution:

500 mg cholesterol (Boehringer Mannheim GmbH, Catalogue No. 121312) is dissolved in 12.5 ml 1-propanol (Merck, Catalogue No. 997), mixed well after addition of 10 g Thesit (Boehringer Mannheim GmbH, Catalogue No. 006190) and water is added to a volume of 100 ml. The substrate solution can be stored for several months at room temperature.

EXAMPLE 2 Characterization of the cholesterol oxidase gene

The plasmid of a clone obtained according to example 1 (pUC-chol-B2) is isolated according to standard methods and subjected to restriction mapping using the restriction endonucleases BamHI, EcoRI, KpnI, XhoI, PstI. It turns out that a DNA fragment from the genome of Brevibacterium with a size of ca. 5.5 kb is inserted into the plasmid pUC-Chol-B2. By subcloning various partial fragments of this 5.5 kb piece and subsequently determining the cholesterol oxidase activity of the E. coli clones obtained, it is possible to narrow down the cholesterol oxidase gene to a BamHI fragment of 2.3 kb size. The plasmid with this fragment is denoted pUC-Chol-B2-BB (FIG. 1). The DNA sequence of this fragment is determined and examined for a reading frame which codes for cholesterol oxidase. The sequence of this reading frame for mature cholesterol oxidase is given in SEQ ID NO 1.

EXAMPLE 3 Construction of a plasmid for expressing the cholesterol oxidase gene with a heterologous signal sequence

Comparison of the N-terminal amino acid sequence of cholesterol oxidase which was isolated from Brevibacterium with the entire reading frame coding for cholesterol oxidase from pUC-Chol-B2-BB shows that the first 52 coded amino acids of the gene sequence are absent in the mature protein. These 52 amino acids have the structure of a typical export signal sequence of gram-positive prokaryotes (von Heijne, Biochim. Biophys.

Acta 947 (1988), 307-333). In order to construct recombinant cholesterol oxidase genes in which this signal sequence is replaced by other sequences, a 387 bp DNA fragment from the plasmid pUC-Chol-B2-BB is firstly amplified by means of PCR using the oligonucleotides shown in SEQ ID NOS 30 and 31. This fragment contains the region coding for the N-terminal part of the mature oxidase with a new SphI cleavage site directly in front of the N-terminus of the amino acid sequence of the mature enzyme. This PCR fragment is cleaved with SphI and PstI and ligated together with a PstI EcoRI fragment from pUC-Chol-B2-BB which contains the remaining part of the cholesterol oxidase gene into the expression vector pmglsphl cleaved with SphI and EcoRI and in this way the vector pmgl-Chol-SB is obtained. In this vector the cholesterol oxidase gene contains a signal sequence from Salmonella typhimurium that is functional in E. coli (described in WO 88/093773).

EXAMPLE 4 Construction of a plasmid for expression of the cholesterol oxidase gene without a signal peptide-coding sequence under the control of the lacUV5 promoter

A DNA fragment of ca. 1.85 kb in size which contains the entire part of the coding sequence of mature cholesterol oxidase but not the sequence coding for the signal peptide is cut out of the plasmid pmgl-Chol-SB by treatment with the restriction endonucleases SphI and BamHI. This fragment is inserted into the plasmid vector pUC19 which has previously been cleaved with SphI and BamHI. In the plasmid plac-Chol-cyt obtained in this manner the cholesterol oxidase gene is present in the correct reading frame and is fused to the first ten codons of the lacZ' gene from pUC19 and is under the control of the lacUV5 promoter (FIG. 2).

EXAMPLE 5 Construction of a plasmid for the expression of the cholesterol oxidase gene without a signal peptide-coding sequence under the control of the oxygen-regulated pfl promoter

A DNA fragment of 432 bp in size which contains a Clal cleavage site in front of the ATG start codon is produced from the plasmid plac-Chol-cyt by the PCR technique using the oligonucleotides shown in SEQ ID NOS 32 and 33. This PCR fragment is cut with ClaI and PstI. In addition a fragment with the remaining C-terminal part of the cholesterol oxidase gene is cleaved from the plasmid plac-Chol-cyt by treatment with the restriction endonucleases PstI and BamHI. Both fragments are simultaneously ligated into the expression vector pfl 20AT1-SD cleaved with BamHI and ClaI. The correct ligation product now contains the reading frame of mature cholesterol oxidase fused to the first ten codons of the lacZ' gene from pUC19 under the control of the oxygen-regulated pfl promoter (FIG. 3). This plasmid is denoted ppfl-Chol-cyt.

EXAMPLE 6 Construction of a plasmid for expressing the cholesterol oxidase gene with an alternative N-terminal fusion sequence

In order to remove the SphI cleavage site of the plasmid ppfl-Chol-cyt located in the 3' untranslated region of the cholesterol oxidase gene, the plasmid DNA is cleaved with SmaI and EcoRV and again religated. 100 ng of the plasmid ppfl-Chol-cyt-Δterm formed in this manner is then cleaved with the restriction enzymes ClaI and SphI. The DNA fragment of 4.76 kb in size which is formed is electrophoretically separated in low-melting point agarose, cleaved and eluted (Glassmilk®-Kit, Bio 101). 100 ng of the DNA fragment purified in this manner is admixed with 50 pmol of an adapter oligonucleotide with the sequence shown in SEQ ID NO 34 (in which "N" denotes an equimolar mixture of all 4 bases) and treated for 2 hours at 37° C. with T4 DNA ligase. Subsequently the mixture is admixed with a mixture of 4 dNTP's (final concentration 0.125 mmol/l) and treated for 40 minutes at 37° C. with Klenow DNA polymerase. The plasmid DNA obtained in this manner is transformed in E. coli XL1-blue (Stratagene). Individual colonies of the clones obtained are compared with the aid of the colony activity test described in example 1 with regard to their cholesterol oxidase activity. Clones with a high cholesterol oxidase activity are isolated and the plasmid DNA is characterized by restriction analysis and DNA sequencing. The plasmid of a clone with a particularly high cholesterol oxidase activity was found to have the sequence SEQ ID NO 23. The plasmid concerned is denoted ppfl-MSM3H-Chol-cyt-Δterm. It is to be expected that further clones suitable for a particularly high expression may be found in the described manner after isolation and characterization of an adequate number of different clones. In order to complete again the 3' untranslated part, the plasmid ppfl-MSM3H-Chol-cyt-Δterm is cleaved with ClaI and XhoI. A DNA fragment of ca. 1.1 kb with the translation initiation region and the N-terminal part of the cholesterol oxidase gene is isolated and ligated into the plasmid ppfl-Chol-cyt which is also cleaved with ClaI and XhoI (FIG. 4). The plasmid obtained is denoted ppfl-MSN3H-Chol-cyt.

EXAMPLE 7 Comparison of the formation of cholesterol oxidase by the various expression plasmids in E. coli

The plasmids pUC-Chol-B2, pUC-Chol-B2-BB, pmgl-Chol-SB, plac-Chol-cyt, pplf-Chol-cyt, ppfl-MSN3H-Chol-cyt are each transformed in E. coli K12 XL1-blue. In order to compare the amount of enzyme formed, the clones are each cultured for 15 hours at 30° C. in LB medium containing 200 μg/ml ampicillin and the following further additives:

clones containing the plasmids pUC-Chol-B2, pUC-Chol-B2-BB, plac-Chol-cyt in which the cholesterol oxidase gene is in each case under the control of the lacUV5 promoter are additionally receive 1 mmol/l IPTG, the clone containing the plasmid pmgl-Chol-SB with the glucose-repressed mgl promoter receives no further additives, clones containing the plasmids ppfl-Chol-cyt, ppfl-MSN3H-Chol-cyt with the oxygen-regulated pfl promoter recieve 0.4% glucose and are grown in closed serum flasks that have been gassed with nitrogen in which the medium was adjusted with KOH to pH 7.0. After the culture is completed the cell density achieved is determined by photometric measurement of the turbidity at 420 nm. The cells of 1 ml culture broth are then sedimented by centrifugation in a microcentrifuge at 10,000 g and again resuspended in 0.5 ml redistilled H₂ O. The cell rupture is carried out by 2×30 seconds ultrasonic treatment (Branson Sonifier, model 450, standard microtip, conical). The cell extracts obtained in this manner are used in the following enzyme test after appropriate dilution: for this the following are pipetted into quartz cuvettes:

3 ml potassium phosphate buffer (0.5 mol/l, pH 7.5) containing 0.4% Thesit® (Boehringer Mannheim GmbH, Catalogue No. 006190),

0.1 ml cholesterol solution (0.4% cholesterol, 10% l-propanol, 10% Thesit®),

0.02 ml H₂ O₂ (0.49 mol/l in redistilled water), it is mixed, after addition of 0.02 ml catalase (from bovine liver, 20 mg protein/ml, specific activity ca. 65,000 U/mg, Boehringer Mannheim GmbH, Catalogue No. 0156744 diluted immediately to 0.075-0.15 U/ml before measurement with ice-cold potassium phosphate buffer, containing 0.4% Thesit) it is again mixed, the solution is brought to a temperature of 25° C. and subsequently the reaction is started by addition of 0.05 ml sample solution. After careful mixing the change in absorbance at 240 nm is monitored and the activity of cholesterol oxidase is determined from the linear region of the absorbance curve: ##EQU1## in which .di-elect cons. 240=15.5 mmol⁻¹ ×1×cm⁻¹.

The values obtained for cell density and enzyme activity are shown in Table 1.

                  TABLE 1     ______________________________________                 Cell density                            Units per cell     Clone/plasmid                 (A 420)    density    Units per ml     ______________________________________     pUC-chol-B2 7.0        0.007      0.049     pUC-chol-B2-BB                 8.4        0.068      0.571     pmgl-chol-SB                 1.3        0.014      0.018     plac-chol-cyt                 8.6        0.725      6.235     ppfl-chol-cyt                 1.25       1.675      2.094     ppfl-MSN3H-chol-cyt                 3.7        1.463      5.413     ______________________________________

The results obtained show that using such constructs which cause a cytoplasmic expression of cholesterol oxidase, a considerably higher activity of the recombinantly produced cholesterol oxidase can be obtained than with those constructs which lead to a secretion of the recombinantly produced cholesterol oxidase.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 34     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1683 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..1683     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     - TCG ACC GGG CCG GTC GCG CCG CTT CCG ACG CC - #G CCG AAC TTC CCG AAC       48     Ser Thr Gly Pro Val Ala Pro Leu Pro Thr Pr - #o Pro Asn Phe Pro Asn     #                 15     - GAC ATC GCG CTG TTC CAG CAG GCG TAC CAG AA - #C TGG TCC AAG GAG ATC       96     Asp Ile Ala Leu Phe Gln Gln Ala Tyr Gln As - #n Trp Ser Lys Glu Ile     #             30     - ATG CTG GAC GCC ACT TGG GTC TGC TCG CCC AA - #G ACG CCG CAG GAT GTC      144     Met Leu Asp Ala Thr Trp Val Cys Ser Pro Ly - #s Thr Pro Gln Asp Val     #         45     - GTT CGC CTT GCC AAC TGG GCG CAC GAG CAC GA - #C TAC AAG ATC CGC CCG      192     Val Arg Leu Ala Asn Trp Ala His Glu His As - #p Tyr Lys Ile Arg Pro     #     60     - CGC GGC GCG ATG CAC GGC TGG ACC CCG CTC AC - #C GTG GAG AAG GGG GCC      240     Arg Gly Ala Met His Gly Trp Thr Pro Leu Th - #r Val Glu Lys Gly Ala     # 80     - AAC GTC GAG AAG GTG ATC CTC GCC GAC ACG AT - #G ACG CAT CTG AAC GGC      288     Asn Val Glu Lys Val Ile Leu Ala Asp Thr Me - #t Thr His Leu Asn Gly     #                 95     - ATC ACG GTG AAC ACG GGC GGC CCC GTG GCT AC - #C GTC ACC GCC GGT GCC      336     Ile Thr Val Asn Thr Gly Gly Pro Val Ala Th - #r Val Thr Ala Gly Ala     #           110     - GGC GCC AGC ATC GAG GCG ATC GTC ACC GAA CT - #G CAG AAG CAC GAC CTC      384     Gly Ala Ser Ile Glu Ala Ile Val Thr Glu Le - #u Gln Lys His Asp Leu     #       125     - GGC TGG GCC AAC CTG CCC GCT CCG GGT GTG CT - #G TCG ATC GGT GGC GCC      432     Gly Trp Ala Asn Leu Pro Ala Pro Gly Val Le - #u Ser Ile Gly Gly Ala     #   140     - CTT GCG GTC AAC GCG CAC GGT GCG GCG CTG CC - #G GCC GTC GGC CAG ACC      480     Leu Ala Val Asn Ala His Gly Ala Ala Leu Pr - #o Ala Val Gly Gln Thr     145                 1 - #50                 1 - #55                 1 -     #60     - ACG CTG CCC GGT CAC ACC TAC GGT TCG CTG AG - #C AAC CTG GTC ACC GAG      528     Thr Leu Pro Gly His Thr Tyr Gly Ser Leu Se - #r Asn Leu Val Thr Glu     #               175     - CTG ACC GCG GTC GTC TGG AAC GGC ACC ACC TA - #C GCA CTC GAG ACG TAC      576     Leu Thr Ala Val Val Trp Asn Gly Thr Thr Ty - #r Ala Leu Glu Thr Tyr     #           190     - CAG CGC AAC GAT CCT CGG ATC ACC CCA CTG CT - #C ACC AAC CTC GGG CGC      624     Gln Arg Asn Asp Pro Arg Ile Thr Pro Leu Le - #u Thr Asn Leu Gly Arg     #       205     - TGC TTC CTG ACC TCG GTG ACG ATG CAG GCC GG - #C CCC AAC TTC CGT CAG      672     Cys Phe Leu Thr Ser Val Thr Met Gln Ala Gl - #y Pro Asn Phe Arg Gln     #   220     - CGG TGC CAG AGC TAC ACC GAC ATC CCG TGG CG - #G GAA CTG TTC GCG CCG      720     Arg Cys Gln Ser Tyr Thr Asp Ile Pro Trp Ar - #g Glu Leu Phe Ala Pro     225                 2 - #30                 2 - #35                 2 -     #40     - AAG GGC GCC GAC GGC CGC ACG TTC GAG AAG TT - #C GTC GCG GAA TCG GGC      768     Lys Gly Ala Asp Gly Arg Thr Phe Glu Lys Ph - #e Val Ala Glu Ser Gly     #               255     - GGC GCC GAG GCG ATC TGG TAC CCG TTC ACC GA - #G AAG CCG TGG ATG AAG      816     Gly Ala Glu Ala Ile Trp Tyr Pro Phe Thr Gl - #u Lys Pro Trp Met Lys     #           270     - GTG TGG ACG GTC TCG CCG ACC AAG CCG GAC TC - #G TCG AAC GAG GTC GGA      864     Val Trp Thr Val Ser Pro Thr Lys Pro Asp Se - #r Ser Asn Glu Val Gly     #       285     - AGC CTC GGC TCG GCG GGC TCC CTC GTC GGC AA - #G CCT CCG CAG GCG CGT      912     Ser Leu Gly Ser Ala Gly Ser Leu Val Gly Ly - #s Pro Pro Gln Ala Arg     #   300     - GAG GTC TCC GGC CCG TAC AAC TAC ATC TTC TC - #C GAC AAC CTG CCG GAG      960     Glu Val Ser Gly Pro Tyr Asn Tyr Ile Phe Se - #r Asp Asn Leu Pro Glu     305                 3 - #10                 3 - #15                 3 -     #20     - CCC ATC ACC GAC ATG ATC GGC GCC ATC AAC GC - #C GGA AAC CCC GGA ATC     1008     Pro Ile Thr Asp Met Ile Gly Ala Ile Asn Al - #a Gly Asn Pro Gly Ile     #               335     - GCA CCG CTG TTC GGC CCG GCG ATG TAC GAG AT - #C ACC AAG CTC GGG CTG     1056     Ala Pro Leu Phe Gly Pro Ala Met Tyr Glu Il - #e Thr Lys Leu Gly Leu     #           350     - GCC GCG ACG AAT GCC AAC GAC ATC TGG GGC TG - #G TCG AAG GAC GTC CAG     1104     Ala Ala Thr Asn Ala Asn Asp Ile Trp Gly Tr - #p Ser Lys Asp Val Gln     #       365     - TTC TAC ATC AAG GCC ACG ACG TTG CGA CTC AC - #C GAG GGC GGC GGC GCC     1152     Phe Tyr Ile Lys Ala Thr Thr Leu Arg Leu Th - #r Glu Gly Gly Gly Ala     #   380     - GTC GTC ACG AGC CGC GCC AAC ATC GCG ACC GT - #G ATC AAC GAC TTC ACC     1200     Val Val Thr Ser Arg Ala Asn Ile Ala Thr Va - #l Ile Asn Asp Phe Thr     385                 3 - #90                 3 - #95                 4 -     #00     - GAG TGG TTC CAC GAG CGC ATC GAG TTC TAC CG - #C GCG AAG GGC GAG TTC     1248     Glu Trp Phe His Glu Arg Ile Glu Phe Tyr Ar - #g Ala Lys Gly Glu Phe     #               415     - CCG CTC AAC GGT CCG GTC GAG ATC CGC TGC TG - #C GGG CTC GAT CAG GCA     1296     Pro Leu Asn Gly Pro Val Glu Ile Arg Cys Cy - #s Gly Leu Asp Gln Ala     #           430     - GCC GAC GTC AAG GTG CCG TCG GTG GGC CCG CC - #G ACC ATC TCG GCG ACC     1344     Ala Asp Val Lys Val Pro Ser Val Gly Pro Pr - #o Thr Ile Ser Ala Thr     #       445     - CGT CCG CGT CCG GAT CAT CCG GAC TGG GAC GT - #C GCG ATC TGG CTG AAC     1392     Arg Pro Arg Pro Asp His Pro Asp Trp Asp Va - #l Ala Ile Trp Leu Asn     #   460     - GTT CTC GGT GTT CCG GGC ACC CCC GGC ATG TT - #C GAG TTC TAC CGC GAG     1440     Val Leu Gly Val Pro Gly Thr Pro Gly Met Ph - #e Glu Phe Tyr Arg Glu     465                 4 - #70                 4 - #75                 4 -     #80     - ATG GAG CAG TGG ATG CGG AGC CAC TAC AAC AA - #C GAC GAC GCC ACC TTC     1488     Met Glu Gln Trp Met Arg Ser His Tyr Asn As - #n Asp Asp Ala Thr Phe     #               495     - CGG CCC GAG TGG TCG AAG GGG TGG GCG TTC GG - #T CCC GAC CCG TAC ACC     1536     Arg Pro Glu Trp Ser Lys Gly Trp Ala Phe Gl - #y Pro Asp Pro Tyr Thr     #           510     - GAC AAC GAC ATC GTC ACG AAC AAG ATG CGC GC - #C ACC TAC ATC GAA GGT     1584     Asp Asn Asp Ile Val Thr Asn Lys Met Arg Al - #a Thr Tyr Ile Glu Gly     #       525     - GTC CCG ACG ACC GAG AAC TGG GAC ACC GCG CG - #C GCT CGG TAC AAC CAG     1632     Val Pro Thr Thr Glu Asn Trp Asp Thr Ala Ar - #g Ala Arg Tyr Asn Gln     #   540     - ATC GAC CCG CAT CGC GTG TTC ACC AAC GGA TT - #C ATG GAC AAG CTG CTT     1680     Ile Asp Pro His Arg Val Phe Thr Asn Gly Ph - #e Met Asp Lys Leu Leu     545                 5 - #50                 5 - #55                 5 -     #60     #           1683     Pro     - (2) INFORMATION FOR SEQ ID NO: 2:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 561 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #2:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Ser Thr Gly Pro Val Ala Pro Leu Pro Thr Pr - #o Pro Asn Phe Pro Asn     #                 15     - Asp Ile Ala Leu Phe Gln Gln Ala Tyr Gln As - #n Trp Ser Lys Glu Ile     #             30     - Met Leu Asp Ala Thr Trp Val Cys Ser Pro Ly - #s Thr Pro Gln Asp Val     #         45     - Val Arg Leu Ala Asn Trp Ala His Glu His As - #p Tyr Lys Ile Arg Pro     #     60     - Arg Gly Ala Met His Gly Trp Thr Pro Leu Th - #r Val Glu Lys Gly Ala     # 80     - Asn Val Glu Lys Val Ile Leu Ala Asp Thr Me - #t Thr His Leu Asn Gly     #                 95     - Ile Thr Val Asn Thr Gly Gly Pro Val Ala Th - #r Val Thr Ala Gly Ala     #           110     - Gly Ala Ser Ile Glu Ala Ile Val Thr Glu Le - #u Gln Lys His Asp Leu     #       125     - Gly Trp Ala Asn Leu Pro Ala Pro Gly Val Le - #u Ser Ile Gly Gly Ala     #   140     - Leu Ala Val Asn Ala His Gly Ala Ala Leu Pr - #o Ala Val Gly Gln Thr     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Leu Pro Gly His Thr Tyr Gly Ser Leu Se - #r Asn Leu Val Thr Glu     #               175     - Leu Thr Ala Val Val Trp Asn Gly Thr Thr Ty - #r Ala Leu Glu Thr Tyr     #           190     - Gln Arg Asn Asp Pro Arg Ile Thr Pro Leu Le - #u Thr Asn Leu Gly Arg     #       205     - Cys Phe Leu Thr Ser Val Thr Met Gln Ala Gl - #y Pro Asn Phe Arg Gln     #   220     - Arg Cys Gln Ser Tyr Thr Asp Ile Pro Trp Ar - #g Glu Leu Phe Ala Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Lys Gly Ala Asp Gly Arg Thr Phe Glu Lys Ph - #e Val Ala Glu Ser Gly     #               255     - Gly Ala Glu Ala Ile Trp Tyr Pro Phe Thr Gl - #u Lys Pro Trp Met Lys     #           270     - Val Trp Thr Val Ser Pro Thr Lys Pro Asp Se - #r Ser Asn Glu Val Gly     #       285     - Ser Leu Gly Ser Ala Gly Ser Leu Val Gly Ly - #s Pro Pro Gln Ala Arg     #   300     - Glu Val Ser Gly Pro Tyr Asn Tyr Ile Phe Se - #r Asp Asn Leu Pro Glu     305                 3 - #10                 3 - #15                 3 -     #20     - Pro Ile Thr Asp Met Ile Gly Ala Ile Asn Al - #a Gly Asn Pro Gly Ile     #               335     - Ala Pro Leu Phe Gly Pro Ala Met Tyr Glu Il - #e Thr Lys Leu Gly Leu     #           350     - Ala Ala Thr Asn Ala Asn Asp Ile Trp Gly Tr - #p Ser Lys Asp Val Gln     #       365     - Phe Tyr Ile Lys Ala Thr Thr Leu Arg Leu Th - #r Glu Gly Gly Gly Ala     #   380     - Val Val Thr Ser Arg Ala Asn Ile Ala Thr Va - #l Ile Asn Asp Phe Thr     385                 3 - #90                 3 - #95                 4 -     #00     - Glu Trp Phe His Glu Arg Ile Glu Phe Tyr Ar - #g Ala Lys Gly Glu Phe     #               415     - Pro Leu Asn Gly Pro Val Glu Ile Arg Cys Cy - #s Gly Leu Asp Gln Ala     #           430     - Ala Asp Val Lys Val Pro Ser Val Gly Pro Pr - #o Thr Ile Ser Ala Thr     #       445     - Arg Pro Arg Pro Asp His Pro Asp Trp Asp Va - #l Ala Ile Trp Leu Asn     #   460     - Val Leu Gly Val Pro Gly Thr Pro Gly Met Ph - #e Glu Phe Tyr Arg Glu     465                 4 - #70                 4 - #75                 4 -     #80     - Met Glu Gln Trp Met Arg Ser His Tyr Asn As - #n Asp Asp Ala Thr Phe     #               495     - Arg Pro Glu Trp Ser Lys Gly Trp Ala Phe Gl - #y Pro Asp Pro Tyr Thr     #           510     - Asp Asn Asp Ile Val Thr Asn Lys Met Arg Al - #a Thr Tyr Ile Glu Gly     #       525     - Val Pro Thr Thr Glu Asn Trp Asp Thr Ala Ar - #g Ala Arg Tyr Asn Gln     #   540     - Ile Asp Pro His Arg Val Phe Thr Asn Gly Ph - #e Met Asp Lys Leu Leu     545                 5 - #50                 5 - #55                 5 -     #60     - Pro     - (2) INFORMATION FOR SEQ ID NO: 3:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 48 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     #3:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #                48CGGT CGAGATCCGC TGCTGCGGGC TCGATCAG     - (2) INFORMATION FOR SEQ ID NO: 4:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 48 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     #4:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #                48TTCT CGGTGTTCCG GGCACCCCCG GCATGTTC     - (2) INFORMATION FOR SEQ ID NO: 5:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 36 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     #5:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #       36         CCGA GTGGTCGAAG GGGTGG     - (2) INFORMATION FOR SEQ ID NO: 6:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 46 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 17..46     #6:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #TTG CAT GCC             46 ATG ATT ACG CCA AGC     #Met Thr Met Ile Thr Pro Ser Leu His Ala     #                10     - (2) INFORMATION FOR SEQ ID NO: 7:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 10 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #7:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Thr Met Ile Thr Pro Ser Leu His Ala     #                 10     - (2) INFORMATION FOR SEQ ID NO: 8:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 49 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 20..49     #8:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - GAATTTAAGG GGAACATCG ATG ACC ATG ATT ACG CCA AGC - # TTG CAT GCC       49     #   Met Thr Met Ile Thr Pro Ser Leu His - # Ala     # 10     - (2) INFORMATION FOR SEQ ID NO: 9:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 10 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #9:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Thr Met Ile Thr Pro Ser Leu His Ala     #                 10     - (2) INFORMATION FOR SEQ ID NO: 10:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 43 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 20..43     #10:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - GAATTTAAGG GGAACATCG ATG AGT AAT CAC CAT GGG CAT - # GCC     #  43     #   Met Ser Asn His His Gly His Ala     #5  1     - (2) INFORMATION FOR SEQ ID NO: 11:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 8 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #11:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Ser Asn His His Gly His Ala       1               5     - (2) INFORMATION FOR SEQ ID NO: 12:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 45 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 19..45     #12:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #CAT GCC               4 - #5 AAT CAT CAC CAT GGG     Met Ser Asn His His His Gly His Ala     1               5     - (2) INFORMATION FOR SEQ ID NO: 13:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 9 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #13:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Ser Asn His His His Gly His Ala      1               5     - (2) INFORMATION FOR SEQ ID NO: 14:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 58 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 20..58     #14:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - GAATTTAAGG GGAACATCG ATG AGT AAT ACG CGT AAA CGC - # AAG CGC CGT ACG       52     #   Met Ser Asn Thr Arg Lys Arg Lys Arg - # Arg Thr     #10     #           58     His Ala     - (2) INFORMATION FOR SEQ ID NO: 15:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 13 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #15:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Ser Asn Thr Arg Lys Arg Lys Arg Arg Th - #r His Ala     #                 10     - (2) INFORMATION FOR SEQ ID NO: 16:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 48 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 25..48     #16:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - GAATTCACAC AGGAAACAGA ATTC ATG GTT ATG CAC CAT G - #GG CAT GCC       48     #Gly His Alat Val Met His His     #      5  1     - (2) INFORMATION FOR SEQ ID NO: 17:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 8 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #17:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Val Met His His Gly His Ala       1               5     - (2) INFORMATION FOR SEQ ID NO: 18:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1729 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 17..1729     #18:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #TTG CAT GCC TCG         49 ATG ATT ACG CCA AGC     #Met Thr Met Ile Thr Pro Ser Leu His Ala S - #er     #               10     - ACC GGG CCG GTC GCG CCG CTT CCG ACG CCG CC - #G AAC TTC CCG AAC GAC       97     Thr Gly Pro Val Ala Pro Leu Pro Thr Pro Pr - #o Asn Phe Pro Asn Asp     #             25     - ATC GCG CTG TTC CAG CAG GCG TAC CAG AAC TG - #G TCC AAG GAG ATC ATG      145     Ile Ala Leu Phe Gln Gln Ala Tyr Gln Asn Tr - #p Ser Lys Glu Ile Met     #         40     - CTG GAC GCC ACT TGG GTC TGC TCG CCC AAG AC - #G CCG CAG GAT GTC GTT      193     Leu Asp Ala Thr Trp Val Cys Ser Pro Lys Th - #r Pro Gln Asp Val Val     #     55     - CGC CTT GCC AAC TGG GCG CAC GAG CAC GAC TA - #C AAG ATC CGC CCG CGC      241     Arg Leu Ala Asn Trp Ala His Glu His Asp Ty - #r Lys Ile Arg Pro Arg     # 75     - GGC GCG ATG CAC GGC TGG ACC CCG CTC ACC GT - #G GAG AAG GGG GCC AAC      289     Gly Ala Met His Gly Trp Thr Pro Leu Thr Va - #l Glu Lys Gly Ala Asn     #                 90     - GTC GAG AAG GTG ATC CTC GCC GAC ACG ATG AC - #G CAT CTG AAC GGC ATC      337     Val Glu Lys Val Ile Leu Ala Asp Thr Met Th - #r His Leu Asn Gly Ile     #            105     - ACG GTG AAC ACG GGC GGC CCC GTG GCT ACC GT - #C ACC GCC GGT GCC GGC      385     Thr Val Asn Thr Gly Gly Pro Val Ala Thr Va - #l Thr Ala Gly Ala Gly     #       120     - GCC AGC ATC GAG GCG ATC GTC ACC GAA CTG CA - #G AAG CAC GAC CTC GGC      433     Ala Ser Ile Glu Ala Ile Val Thr Glu Leu Gl - #n Lys His Asp Leu Gly     #   135     - TGG GCC AAC CTG CCC GCT CCG GGT GTG CTG TC - #G ATC GGT GGC GCC CTT      481     Trp Ala Asn Leu Pro Ala Pro Gly Val Leu Se - #r Ile Gly Gly Ala Leu     140                 1 - #45                 1 - #50                 1 -     #55     - GCG GTC AAC GCG CAC GGT GCG GCG CTG CCG GC - #C GTC GGC CAG ACC ACG      529     Ala Val Asn Ala His Gly Ala Ala Leu Pro Al - #a Val Gly Gln Thr Thr     #               170     - CTG CCC GGT CAC ACC TAC GGT TCG CTG AGC AA - #C CTG GTC ACC GAG CTG      577     Leu Pro Gly His Thr Tyr Gly Ser Leu Ser As - #n Leu Val Thr Glu Leu     #           185     - ACC GCG GTC GTC TGG AAC GGC ACC ACC TAC GC - #A CTC GAG ACG TAC CAG      625     Thr Ala Val Val Trp Asn Gly Thr Thr Tyr Al - #a Leu Glu Thr Tyr Gln     #       200     - CGC AAC GAT CCT CGG ATC ACC CCA CTG CTC AC - #C AAC CTC GGG CGC TGC      673     Arg Asn Asp Pro Arg Ile Thr Pro Leu Leu Th - #r Asn Leu Gly Arg Cys     #   215     - TTC CTG ACC TCG GTG ACG ATG CAG GCC GGC CC - #C AAC TTC CGT CAG CGG      721     Phe Leu Thr Ser Val Thr Met Gln Ala Gly Pr - #o Asn Phe Arg Gln Arg     220                 2 - #25                 2 - #30                 2 -     #35     - TGC CAG AGC TAC ACC GAC ATC CCG TGG CGG GA - #A CTG TTC GCG CCG AAG      769     Cys Gln Ser Tyr Thr Asp Ile Pro Trp Arg Gl - #u Leu Phe Ala Pro Lys     #               250     - GGC GCC GAC GGC CGC ACG TTC GAG AAG TTC GT - #C GCG GAA TCG GGC GGC      817     Gly Ala Asp Gly Arg Thr Phe Glu Lys Phe Va - #l Ala Glu Ser Gly Gly     #           265     - GCC GAG GCG ATC TGG TAC CCG TTC ACC GAG AA - #G CCG TGG ATG AAG GTG      865     Ala Glu Ala Ile Trp Tyr Pro Phe Thr Glu Ly - #s Pro Trp Met Lys Val     #       280     - TGG ACG GTC TCG CCG ACC AAG CCG GAC TCG TC - #G AAC GAG GTC GGA AGC      913     Trp Thr Val Ser Pro Thr Lys Pro Asp Ser Se - #r Asn Glu Val Gly Ser     #   295     - CTC GGC TCG GCG GGC TCC CTC GTC GGC AAG CC - #T CCG CAG GCG CGT GAG      961     Leu Gly Ser Ala Gly Ser Leu Val Gly Lys Pr - #o Pro Gln Ala Arg Glu     300                 3 - #05                 3 - #10                 3 -     #15     - GTC TCC GGC CCG TAC AAC TAC ATC TTC TCC GA - #C AAC CTG CCG GAG CCC     1009     Val Ser Gly Pro Tyr Asn Tyr Ile Phe Ser As - #p Asn Leu Pro Glu Pro     #               330     - ATC ACC GAC ATG ATC GGC GCC ATC AAC GCC GG - #A AAC CCC GGA ATC GCA     1057     Ile Thr Asp Met Ile Gly Ala Ile Asn Ala Gl - #y Asn Pro Gly Ile Ala     #           345     - CCG CTG TTC GGC CCG GCG ATG TAC GAG ATC AC - #C AAG CTC GGG CTG GCC     1105     Pro Leu Phe Gly Pro Ala Met Tyr Glu Ile Th - #r Lys Leu Gly Leu Ala     #       360     - GCG ACG AAT GCC AAC GAC ATC TGG GGC TGG TC - #G AAG GAC GTC CAG TTC     1153     Ala Thr Asn Ala Asn Asp Ile Trp Gly Trp Se - #r Lys Asp Val Gln Phe     #   375     - TAC ATC AAG GCC ACG ACG TTG CGA CTC ACC GA - #G GGC GGC GGC GCC GTC     1201     Tyr Ile Lys Ala Thr Thr Leu Arg Leu Thr Gl - #u Gly Gly Gly Ala Val     380                 3 - #85                 3 - #90                 3 -     #95     - GTC ACG AGC CGC GCC AAC ATC GCG ACC GTG AT - #C AAC GAC TTC ACC GAG     1249     Val Thr Ser Arg Ala Asn Ile Ala Thr Val Il - #e Asn Asp Phe Thr Glu     #               410     - TGG TTC CAC GAG CGC ATC GAG TTC TAC CGC GC - #G AAG GGC GAG TTC CCG     1297     Trp Phe His Glu Arg Ile Glu Phe Tyr Arg Al - #a Lys Gly Glu Phe Pro     #           425     - CTC AAC GGT CCG GTC GAG ATC CGC TGC TGC GG - #G CTC GAT CAG GCA GCC     1345     Leu Asn Gly Pro Val Glu Ile Arg Cys Cys Gl - #y Leu Asp Gln Ala Ala     #       440     - GAC GTC AAG GTG CCG TCG GTG GGC CCG CCG AC - #C ATC TCG GCG ACC CGT     1393     Asp Val Lys Val Pro Ser Val Gly Pro Pro Th - #r Ile Ser Ala Thr Arg     #   455     - CCG CGT CCG GAT CAT CCG GAC TGG GAC GTC GC - #G ATC TGG CTG AAC GTT     1441     Pro Arg Pro Asp His Pro Asp Trp Asp Val Al - #a Ile Trp Leu Asn Val     460                 4 - #65                 4 - #70                 4 -     #75     - CTC GGT GTT CCG GGC ACC CCC GGC ATG TTC GA - #G TTC TAC CGC GAG ATG     1489     Leu Gly Val Pro Gly Thr Pro Gly Met Phe Gl - #u Phe Tyr Arg Glu Met     #               490     - GAG CAG TGG ATG CGG AGC CAC TAC AAC AAC GA - #C GAC GCC ACC TTC CGG     1537     Glu Gln Trp Met Arg Ser His Tyr Asn Asn As - #p Asp Ala Thr Phe Arg     #           505     - CCC GAG TGG TCG AAG GGG TGG GCG TTC GGT CC - #C GAC CCG TAC ACC GAC     1585     Pro Glu Trp Ser Lys Gly Trp Ala Phe Gly Pr - #o Asp Pro Tyr Thr Asp     #       520     - AAC GAC ATC GTC ACG AAC AAG ATG CGC GCC AC - #C TAC ATC GAA GGT GTC     1633     Asn Asp Ile Val Thr Asn Lys Met Arg Ala Th - #r Tyr Ile Glu Gly Val     #   535     - CCG ACG ACC GAG AAC TGG GAC ACC GCG CGC GC - #T CGG TAC AAC CAG ATC     1681     Pro Thr Thr Glu Asn Trp Asp Thr Ala Arg Al - #a Arg Tyr Asn Gln Ile     540                 5 - #45                 5 - #50                 5 -     #55     - GAC CCG CAT CGC GTG TTC ACC AAC GGA TTC AT - #G GAC AAG CTG CTT CCG     1729     Asp Pro His Arg Val Phe Thr Asn Gly Phe Me - #t Asp Lys Leu Leu Pro     #               570     - (2) INFORMATION FOR SEQ ID NO: 19:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 571 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: Protein     #19:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Thr Met Ile Thr Pro Ser Leu His Ala Se - #r Thr Gly Pro Val Ala     #                 15     - Pro Leu Pro Thr Pro Pro Asn Phe Pro Asn As - #p Ile Ala Leu Phe Gln     #             30     - Gln Ala Tyr Gln Asn Trp Ser Lys Glu Ile Me - #t Leu Asp Ala Thr Trp     #         45     - Val Cys Ser Pro Lys Thr Pro Gln Asp Val Va - #l Arg Leu Ala Asn Trp     #     60     - Ala His Glu His Asp Tyr Lys Ile Arg Pro Ar - #g Gly Ala Met His Gly     # 80     - Trp Thr Pro Leu Thr Val Glu Lys Gly Ala As - #n Val Glu Lys Val Ile     #                 95     - Leu Ala Asp Thr Met Thr His Leu Asn Gly Il - #e Thr Val Asn Thr Gly     #           110     - Gly Pro Val Ala Thr Val Thr Ala Gly Ala Gl - #y Ala Ser Ile Glu Ala     #       125     - Ile Val Thr Glu Leu Gln Lys His Asp Leu Gl - #y Trp Ala Asn Leu Pro     #   140     - Ala Pro Gly Val Leu Ser Ile Gly Gly Ala Le - #u Ala Val Asn Ala His     145                 1 - #50                 1 - #55                 1 -     #60     - Gly Ala Ala Leu Pro Ala Val Gly Gln Thr Th - #r Leu Pro Gly His Thr     #               175     - Tyr Gly Ser Leu Ser Asn Leu Val Thr Glu Le - #u Thr Ala Val Val Trp     #           190     - Asn Gly Thr Thr Tyr Ala Leu Glu Thr Tyr Gl - #n Arg Asn Asp Pro Arg     #       205     - Ile Thr Pro Leu Leu Thr Asn Leu Gly Arg Cy - #s Phe Leu Thr Ser Val     #   220     - Thr Met Gln Ala Gly Pro Asn Phe Arg Gln Ar - #g Cys Gln Ser Tyr Thr     225                 2 - #30                 2 - #35                 2 -     #40     - Asp Ile Pro Trp Arg Glu Leu Phe Ala Pro Ly - #s Gly Ala Asp Gly Arg     #               255     - Thr Phe Glu Lys Phe Val Ala Glu Ser Gly Gl - #y Ala Glu Ala Ile Trp     #           270     - Tyr Pro Phe Thr Glu Lys Pro Trp Met Lys Va - #l Trp Thr Val Ser Pro     #       285     - Thr Lys Pro Asp Ser Ser Asn Glu Val Gly Se - #r Leu Gly Ser Ala Gly     #   300     - Ser Leu Val Gly Lys Pro Pro Gln Ala Arg Gl - #u Val Ser Gly Pro Tyr     305                 3 - #10                 3 - #15                 3 -     #20     - Asn Tyr Ile Phe Ser Asp Asn Leu Pro Glu Pr - #o Ile Thr Asp Met Ile     #               335     - Gly Ala Ile Asn Ala Gly Asn Pro Gly Ile Al - #a Pro Leu Phe Gly Pro     #           350     - Ala Met Tyr Glu Ile Thr Lys Leu Gly Leu Al - #a Ala Thr Asn Ala Asn     #       365     - Asp Ile Trp Gly Trp Ser Lys Asp Val Gln Ph - #e Tyr Ile Lys Ala Thr     #   380     - Thr Leu Arg Leu Thr Glu Gly Gly Gly Ala Va - #l Val Thr Ser Arg Ala     385                 3 - #90                 3 - #95                 4 -     #00     - Asn Ile Ala Thr Val Ile Asn Asp Phe Thr Gl - #u Trp Phe His Glu Arg     #               415     - Ile Glu Phe Tyr Arg Ala Lys Gly Glu Phe Pr - #o Leu Asn Gly Pro Val     #           430     - Glu Ile Arg Cys Cys Gly Leu Asp Gln Ala Al - #a Asp Val Lys Val Pro     #       445     - Ser Val Gly Pro Pro Thr Ile Ser Ala Thr Ar - #g Pro Arg Pro Asp His     #   460     - Pro Asp Trp Asp Val Ala Ile Trp Leu Asn Va - #l Leu Gly Val Pro Gly     465                 4 - #70                 4 - #75                 4 -     #80     - Thr Pro Gly Met Phe Glu Phe Tyr Arg Glu Me - #t Glu Gln Trp Met Arg     #               495     - Ser His Tyr Asn Asn Asp Asp Ala Thr Phe Ar - #g Pro Glu Trp Ser Lys     #           510     - Gly Trp Ala Phe Gly Pro Asp Pro Tyr Thr As - #p Asn Asp Ile Val Thr     #       525     - Asn Lys Met Arg Ala Thr Tyr Ile Glu Gly Va - #l Pro Thr Thr Glu Asn     #   540     - Trp Asp Thr Ala Arg Ala Arg Tyr Asn Gln Il - #e Asp Pro His Arg Val     545                 5 - #50                 5 - #55                 5 -     #60     - Phe Thr Asn Gly Phe Met Asp Lys Leu Leu Pr - #o     #               570     - (2) INFORMATION FOR SEQ ID NO: 20:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1732 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 20..1732     #20:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - GAATTTAAGG GGAACATCG ATG ACC ATG ATT ACG CCA AGC - # TTG CAT GCC TCG       52     Met Thr Met Ile Thr Pro Ser Leu His Ala Se - #r     #               10     - ACC GGG CCG GTC GCG CCG CTT CCG ACG CCG CC - #G AAC TTC CCG AAC GAC      100     Thr Gly Pro Val Ala Pro Leu Pro Thr Pro Pr - #o Asn Phe Pro Asn Asp     #             25     - ATC GCG CTG TTC CAG CAG GCG TAC CAG AAC TG - #G TCC AAG GAG ATC ATG      148     Ile Ala Leu Phe Gln Gln Ala Tyr Gln Asn Tr - #p Ser Lys Glu Ile Met     #         40     - CTG GAC GCC ACT TGG GTC TGC TCG CCC AAG AC - #G CCG CAG GAT GTC GTT      196     Leu Asp Ala Thr Trp Val Cys Ser Pro Lys Th - #r Pro Gln Asp Val Val     #     55     - CGC CTT GCC AAC TGG GCG CAC GAG CAC GAC TA - #C AAG ATC CGC CCG CGC      244     Arg Leu Ala Asn Trp Ala His Glu His Asp Ty - #r Lys Ile Arg Pro Arg     # 75     - GGC GCG ATG CAC GGC TGG ACC CCG CTC ACC GT - #G GAG AAG GGG GCC AAC      292     Gly Ala Met His Gly Trp Thr Pro Leu Thr Va - #l Glu Lys Gly Ala Asn     #                 90     - GTC GAG AAG GTG ATC CTC GCC GAC ACG ATG AC - #G CAT CTG AAC GGC ATC      340     Val Glu Lys Val Ile Leu Ala Asp Thr Met Th - #r His Leu Asn Gly Ile     #            105     - ACG GTG AAC ACG GGC GGC CCC GTG GCT ACC GT - #C ACC GCC GGT GCC GGC      388     Thr Val Asn Thr Gly Gly Pro Val Ala Thr Va - #l Thr Ala Gly Ala Gly     #       120     - GCC AGC ATC GAG GCG ATC GTC ACC GAA CTG CA - #G AAG CAC GAC CTC GGC      436     Ala Ser Ile Glu Ala Ile Val Thr Glu Leu Gl - #n Lys His Asp Leu Gly     #   135     - TGG GCC AAC CTG CCC GCT CCG GGT GTG CTG TC - #G ATC GGT GGC GCC CTT      484     Trp Ala Asn Leu Pro Ala Pro Gly Val Leu Se - #r Ile Gly Gly Ala Leu     140                 1 - #45                 1 - #50                 1 -     #55     - GCG GTC AAC GCG CAC GGT GCG GCG CTG CCG GC - #C GTC GGC CAG ACC ACG      532     Ala Val Asn Ala His Gly Ala Ala Leu Pro Al - #a Val Gly Gln Thr Thr     #               170     - CTG CCC GGT CAC ACC TAC GGT TCG CTG AGC AA - #C CTG GTC ACC GAG CTG      580     Leu Pro Gly His Thr Tyr Gly Ser Leu Ser As - #n Leu Val Thr Glu Leu     #           185     - ACC GCG GTC GTC TGG AAC GGC ACC ACC TAC GC - #A CTC GAG ACG TAC CAG      628     Thr Ala Val Val Trp Asn Gly Thr Thr Tyr Al - #a Leu Glu Thr Tyr Gln     #       200     - CGC AAC GAT CCT CGG ATC ACC CCA CTG CTC AC - #C AAC CTC GGG CGC TGC      676     Arg Asn Asp Pro Arg Ile Thr Pro Leu Leu Th - #r Asn Leu Gly Arg Cys     #   215     - TTC CTG ACC TCG GTG ACG ATG CAG GCC GGC CC - #C AAC TTC CGT CAG CGG      724     Phe Leu Thr Ser Val Thr Met Gln Ala Gly Pr - #o Asn Phe Arg Gln Arg     220                 2 - #25                 2 - #30                 2 -     #35     - TGC CAG AGC TAC ACC GAC ATC CCG TGG CGG GA - #A CTG TTC GCG CCG AAG      772     Cys Gln Ser Tyr Thr Asp Ile Pro Trp Arg Gl - #u Leu Phe Ala Pro Lys     #               250     - GGC GCC GAC GGC CGC ACG TTC GAG AAG TTC GT - #C GCG GAA TCG GGC GGC      820     Gly Ala Asp Gly Arg Thr Phe Glu Lys Phe Va - #l Ala Glu Ser Gly Gly     #           265     - GCC GAG GCG ATC TGG TAC CCG TTC ACC GAG AA - #G CCG TGG ATG AAG GTG      868     Ala Glu Ala Ile Trp Tyr Pro Phe Thr Glu Ly - #s Pro Trp Met Lys Val     #       280     - TGG ACG GTC TCG CCG ACC AAG CCG GAC TCG TC - #G AAC GAG GTC GGA AGC      916     Trp Thr Val Ser Pro Thr Lys Pro Asp Ser Se - #r Asn Glu Val Gly Ser     #   295     - CTC GGC TCG GCG GGC TCC CTC GTC GGC AAG CC - #T CCG CAG GCG CGT GAG      964     Leu Gly Ser Ala Gly Ser Leu Val Gly Lys Pr - #o Pro Gln Ala Arg Glu     300                 3 - #05                 3 - #10                 3 -     #15     - GTC TCC GGC CCG TAC AAC TAC ATC TTC TCC GA - #C AAC CTG CCG GAG CCC     1012     Val Ser Gly Pro Tyr Asn Tyr Ile Phe Ser As - #p Asn Leu Pro Glu Pro     #               330     - ATC ACC GAC ATG ATC GGC GCC ATC AAC GCC GG - #A AAC CCC GGA ATC GCA     1060     Ile Thr Asp Met Ile Gly Ala Ile Asn Ala Gl - #y Asn Pro Gly Ile Ala     #           345     - CCG CTG TTC GGC CCG GCG ATG TAC GAG ATC AC - #C AAG CTC GGG CTG GCC     1108     Pro Leu Phe Gly Pro Ala Met Tyr Glu Ile Th - #r Lys Leu Gly Leu Ala     #       360     - GCG ACG AAT GCC AAC GAC ATC TGG GGC TGG TC - #G AAG GAC GTC CAG TTC     1156     Ala Thr Asn Ala Asn Asp Ile Trp Gly Trp Se - #r Lys Asp Val Gln Phe     #   375     - TAC ATC AAG GCC ACG ACG TTG CGA CTC ACC GA - #G GGC GGC GGC GCC GTC     1204     Tyr Ile Lys Ala Thr Thr Leu Arg Leu Thr Gl - #u Gly Gly Gly Ala Val     380                 3 - #85                 3 - #90                 3 -     #95     - GTC ACG AGC CGC GCC AAC ATC GCG ACC GTG AT - #C AAC GAC TTC ACC GAG     1252     Val Thr Ser Arg Ala Asn Ile Ala Thr Val Il - #e Asn Asp Phe Thr Glu     #               410     - TGG TTC CAC GAG CGC ATC GAG TTC TAC CGC GC - #G AAG GGC GAG TTC CCG     1300     Trp Phe His Glu Arg Ile Glu Phe Tyr Arg Al - #a Lys Gly Glu Phe Pro     #           425     - CTC AAC GGT CCG GTC GAG ATC CGC TGC TGC GG - #G CTC GAT CAG GCA GCC     1348     Leu Asn Gly Pro Val Glu Ile Arg Cys Cys Gl - #y Leu Asp Gln Ala Ala     #       440     - GAC GTC AAG GTG CCG TCG GTG GGC CCG CCG AC - #C ATC TCG GCG ACC CGT     1396     Asp Val Lys Val Pro Ser Val Gly Pro Pro Th - #r Ile Ser Ala Thr Arg     #   455     - CCG CGT CCG GAT CAT CCG GAC TGG GAC GTC GC - #G ATC TGG CTG AAC GTT     1444     Pro Arg Pro Asp His Pro Asp Trp Asp Val Al - #a Ile Trp Leu Asn Val     460                 4 - #65                 4 - #70                 4 -     #75     - CTC GGT GTT CCG GGC ACC CCC GGC ATG TTC GA - #G TTC TAC CGC GAG ATG     1492     Leu Gly Val Pro Gly Thr Pro Gly Met Phe Gl - #u Phe Tyr Arg Glu Met     #               490     - GAG CAG TGG ATG CGG AGC CAC TAC AAC AAC GA - #C GAC GCC ACC TTC CGG     1540     Glu Gln Trp Met Arg Ser His Tyr Asn Asn As - #p Asp Ala Thr Phe Arg     #           505     - CCC GAG TGG TCG AAG GGG TGG GCG TTC GGT CC - #C GAC CCG TAC ACC GAC     1588     Pro Glu Trp Ser Lys Gly Trp Ala Phe Gly Pr - #o Asp Pro Tyr Thr Asp     #       520     - AAC GAC ATC GTC ACG AAC AAG ATG CGC GCC AC - #C TAC ATC GAA GGT GTC     1636     Asn Asp Ile Val Thr Asn Lys Met Arg Ala Th - #r Tyr Ile Glu Gly Val     #   535     - CCG ACG ACC GAG AAC TGG GAC ACC GCG CGC GC - #T CGG TAC AAC CAG ATC     1684     Pro Thr Thr Glu Asn Trp Asp Thr Ala Arg Al - #a Arg Tyr Asn Gln Ile     540                 5 - #45                 5 - #50                 5 -     #55     - GAC CCG CAT CGC GTG TTC ACC AAC GGA TTC AT - #G GAC AAG CTG CTT CCG     1732     Asp Pro His Arg Val Phe Thr Asn Gly Phe Me - #t Asp Lys Leu Leu Pro     #               570     - (2) INFORMATION FOR SEQ ID NO: 21:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 571 amino               (B) TYPE: amino acids               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: Protein     #21:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Thr Met Ile Thr Pro Ser Leu His Ala Se - #r Thr Gly Pro Val Ala     #                 15     - Pro Leu Pro Thr Pro Pro Asn Phe Pro Asn As - #p Ile Ala Leu Phe Gln     #             30     - Gln Ala Tyr Gln Asn Trp Ser Lys Glu Ile Me - #t Leu Asp Ala Thr Trp     #         45     - Val Cys Ser Pro Lys Thr Pro Gln Asp Val Va - #l Arg Leu Ala Asn Trp     #     60     - Ala His Glu His Asp Tyr Lys Ile Arg Pro Ar - #g Gly Ala Met His Gly     # 80     - Trp Thr Pro Leu Thr Val Glu Lys Gly Ala As - #n Val Glu Lys Val Ile     #                 95     - Leu Ala Asp Thr Met Thr His Leu Asn Gly Il - #e Thr Val Asn Thr Gly     #           110     - Gly Pro Val Ala Thr Val Thr Ala Gly Ala Gl - #y Ala Ser Ile Glu Ala     #       125     - Ile Val Thr Glu Leu Gln Lys His Asp Leu Gl - #y Trp Ala Asn Leu Pro     #   140     - Ala Pro Gly Val Leu Ser Ile Gly Gly Ala Le - #u Ala Val Asn Ala His     145                 1 - #50                 1 - #55                 1 -     #60     - Gly Ala Ala Leu Pro Ala Val Gly Gln Thr Th - #r Leu Pro Gly His Thr     #               175     - Tyr Gly Ser Leu Ser Asn Leu Val Thr Glu Le - #u Thr Ala Val Val Trp     #           190     - Asn Gly Thr Thr Tyr Ala Leu Glu Thr Tyr Gl - #n Arg Asn Asp Pro Arg     #       205     - Ile Thr Pro Leu Leu Thr Asn Leu Gly Arg Cy - #s Phe Leu Thr Ser Val     #   220     - Thr Met Gln Ala Gly Pro Asn Phe Arg Gln Ar - #g Cys Gln Ser Tyr Thr     225                 2 - #30                 2 - #35                 2 -     #40     - Asp Ile Pro Trp Arg Glu Leu Phe Ala Pro Ly - #s Gly Ala Asp Gly Arg     #               255     - Thr Phe Glu Lys Phe Val Ala Glu Ser Gly Gl - #y Ala Glu Ala Ile Trp     #           270     - Tyr Pro Phe Thr Glu Lys Pro Trp Met Lys Va - #l Trp Thr Val Ser Pro     #       285     - Thr Lys Pro Asp Ser Ser Asn Glu Val Gly Se - #r Leu Gly Ser Ala Gly     #   300     - Ser Leu Val Gly Lys Pro Pro Gln Ala Arg Gl - #u Val Ser Gly Pro Tyr     305                 3 - #10                 3 - #15                 3 -     #20     - Asn Tyr Ile Phe Ser Asp Asn Leu Pro Glu Pr - #o Ile Thr Asp Met Ile     #               335     - Gly Ala Ile Asn Ala Gly Asn Pro Gly Ile Al - #a Pro Leu Phe Gly Pro     #           350     - Ala Met Tyr Glu Ile Thr Lys Leu Gly Leu Al - #a Ala Thr Asn Ala Asn     #       365     - Asp Ile Trp Gly Trp Ser Lys Asp Val Gln Ph - #e Tyr Ile Lys Ala Thr     #   380     - Thr Leu Arg Leu Thr Glu Gly Gly Gly Ala Va - #l Val Thr Ser Arg Ala     385                 3 - #90                 3 - #95                 4 -     #00     - Asn Ile Ala Thr Val Ile Asn Asp Phe Thr Gl - #u Trp Phe His Glu Arg     #               415     - Ile Glu Phe Tyr Arg Ala Lys Gly Glu Phe Pr - #o Leu Asn Gly Pro Val     #           430     - Glu Ile Arg Cys Cys Gly Leu Asp Gln Ala Al - #a Asp Val Lys Val Pro     #       445     - Ser Val Gly Pro Pro Thr Ile Ser Ala Thr Ar - #g Pro Arg Pro Asp His     #   460     - Pro Asp Trp Asp Val Ala Ile Trp Leu Asn Va - #l Leu Gly Val Pro Gly     465                 4 - #70                 4 - #75                 4 -     #80     - Thr Pro Gly Met Phe Glu Phe Tyr Arg Glu Me - #t Glu Gln Trp Met Arg     #               495     - Ser His Tyr Asn Asn Asp Asp Ala Thr Phe Ar - #g Pro Glu Trp Ser Lys     #           510     - Gly Trp Ala Phe Gly Pro Asp Pro Tyr Thr As - #p Asn Asp Ile Val Thr     #       525     - Asn Lys Met Arg Ala Thr Tyr Ile Glu Gly Va - #l Pro Thr Thr Glu Asn     #   540     - Trp Asp Thr Ala Arg Ala Arg Tyr Asn Gln Il - #e Asp Pro His Arg Val     545                 5 - #50                 5 - #55                 5 -     #60     - Phe Thr Asn Gly Phe Met Asp Lys Leu Leu Pr - #o     #               570     - (2) INFORMATION FOR SEQ ID NO: 22:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1726 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 20..1726     #22:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - GAATTTAAGG GGAACATCG ATG AGT AAT CAC CAT GGG CAT - # GCC TCG ACC GGG       52     Met Ser Asn His His Gly His Ala Ser Thr Gl - #y     #               10     - CCG GTC GCG CCG CTT CCG ACG CCG CCG AAC TT - #C CCG AAC GAC ATC GCG      100     Pro Val Ala Pro Leu Pro Thr Pro Pro Asn Ph - #e Pro Asn Asp Ile Ala     #             25     - CTG TTC CAG CAG GCG TAC CAG AAC TGG TCC AA - #G GAG ATC ATG CTG GAC      148     Leu Phe Gln Gln Ala Tyr Gln Asn Trp Ser Ly - #s Glu Ile Met Leu Asp     #         40     - GCC ACT TGG GTC TGC TCG CCC AAG ACG CCG CA - #G GAT GTC GTT CGC CTT      196     Ala Thr Trp Val Cys Ser Pro Lys Thr Pro Gl - #n Asp Val Val Arg Leu     #     55     - GCC AAC TGG GCG CAC GAG CAC GAC TAC AAG AT - #C CGC CCG CGC GGC GCG      244     Ala Asn Trp Ala His Glu His Asp Tyr Lys Il - #e Arg Pro Arg Gly Ala     # 75     - ATG CAC GGC TGG ACC CCG CTC ACC GTG GAG AA - #G GGG GCC AAC GTC GAG      292     Met His Gly Trp Thr Pro Leu Thr Val Glu Ly - #s Gly Ala Asn Val Glu     #                 90     - AAG GTG ATC CTC GCC GAC ACG ATG ACG CAT CT - #G AAC GGC ATC ACG GTG      340     Lys Val Ile Leu Ala Asp Thr Met Thr His Le - #u Asn Gly Ile Thr Val     #            105     - AAC ACG GGC GGC CCC GTG GCT ACC GTC ACC GC - #C GGT GCC GGC GCC AGC      388     Asn Thr Gly Gly Pro Val Ala Thr Val Thr Al - #a Gly Ala Gly Ala Ser     #       120     - ATC GAG GCG ATC GTC ACC GAA CTG CAG AAG CA - #C GAC CTC GGC TGG GCC      436     Ile Glu Ala Ile Val Thr Glu Leu Gln Lys Hi - #s Asp Leu Gly Trp Ala     #   135     - AAC CTG CCC GCT CCG GGT GTG CTG TCG ATC GG - #T GGC GCC CTT GCG GTC      484     Asn Leu Pro Ala Pro Gly Val Leu Ser Ile Gl - #y Gly Ala Leu Ala Val     140                 1 - #45                 1 - #50                 1 -     #55     - AAC GCG CAC GGT GCG GCG CTG CCG GCC GTC GG - #C CAG ACC ACG CTG CCC      532     Asn Ala His Gly Ala Ala Leu Pro Ala Val Gl - #y Gln Thr Thr Leu Pro     #               170     - GGT CAC ACC TAC GGT TCG CTG AGC AAC CTG GT - #C ACC GAG CTG ACC GCG      580     Gly His Thr Tyr Gly Ser Leu Ser Asn Leu Va - #l Thr Glu Leu Thr Ala     #           185     - GTC GTC TGG AAC GGC ACC ACC TAC GCA CTC GA - #G ACG TAC CAG CGC AAC      628     Val Val Trp Asn Gly Thr Thr Tyr Ala Leu Gl - #u Thr Tyr Gln Arg Asn     #       200     - GAT CCT CGG ATC ACC CCA CTG CTC ACC AAC CT - #C GGG CGC TGC TTC CTG      676     Asp Pro Arg Ile Thr Pro Leu Leu Thr Asn Le - #u Gly Arg Cys Phe Leu     #   215     - ACC TCG GTG ACG ATG CAG GCC GGC CCC AAC TT - #C CGT CAG CGG TGC CAG      724     Thr Ser Val Thr Met Gln Ala Gly Pro Asn Ph - #e Arg Gln Arg Cys Gln     220                 2 - #25                 2 - #30                 2 -     #35     - AGC TAC ACC GAC ATC CCG TGG CGG GAA CTG TT - #C GCG CCG AAG GGC GCC      772     Ser Tyr Thr Asp Ile Pro Trp Arg Glu Leu Ph - #e Ala Pro Lys Gly Ala     #               250     - GAC GGC CGC ACG TTC GAG AAG TTC GTC GCG GA - #A TCG GGC GGC GCC GAG      820     Asp Gly Arg Thr Phe Glu Lys Phe Val Ala Gl - #u Ser Gly Gly Ala Glu     #           265     - GCG ATC TGG TAC CCG TTC ACC GAG AAG CCG TG - #G ATG AAG GTG TGG ACG      868     Ala Ile Trp Tyr Pro Phe Thr Glu Lys Pro Tr - #p Met Lys Val Trp Thr     #       280     - GTC TCG CCG ACC AAG CCG GAC TCG TCG AAC GA - #G GTC GGA AGC CTC GGC      916     Val Ser Pro Thr Lys Pro Asp Ser Ser Asn Gl - #u Val Gly Ser Leu Gly     #   295     - TCG GCG GGC TCC CTC GTC GGC AAG CCT CCG CA - #G GCG CGT GAG GTC TCC      964     Ser Ala Gly Ser Leu Val Gly Lys Pro Pro Gl - #n Ala Arg Glu Val Ser     300                 3 - #05                 3 - #10                 3 -     #15     - GGC CCG TAC AAC TAC ATC TTC TCC GAC AAC CT - #G CCG GAG CCC ATC ACC     1012     Gly Pro Tyr Asn Tyr Ile Phe Ser Asp Asn Le - #u Pro Glu Pro Ile Thr     #               330     - GAC ATG ATC GGC GCC ATC AAC GCC GGA AAC CC - #C GGA ATC GCA CCG CTG     1060     Asp Met Ile Gly Ala Ile Asn Ala Gly Asn Pr - #o Gly Ile Ala Pro Leu     #           345     - TTC GGC CCG GCG ATG TAC GAG ATC ACC AAG CT - #C GGG CTG GCC GCG ACG     1108     Phe Gly Pro Ala Met Tyr Glu Ile Thr Lys Le - #u Gly Leu Ala Ala Thr     #       360     - AAT GCC AAC GAC ATC TGG GGC TGG TCG AAG GA - #C GTC CAG TTC TAC ATC     1156     Asn Ala Asn Asp Ile Trp Gly Trp Ser Lys As - #p Val Gln Phe Tyr Ile     #   375     - AAG GCC ACG ACG TTG CGA CTC ACC GAG GGC GG - #C GGC GCC GTC GTC ACG     1204     Lys Ala Thr Thr Leu Arg Leu Thr Glu Gly Gl - #y Gly Ala Val Val Thr     380                 3 - #85                 3 - #90                 3 -     #95     - AGC CGC GCC AAC ATC GCG ACC GTG ATC AAC GA - #C TTC ACC GAG TGG TTC     1252     Ser Arg Ala Asn Ile Ala Thr Val Ile Asn As - #p Phe Thr Glu Trp Phe     #               410     - CAC GAG CGC ATC GAG TTC TAC CGC GCG AAG GG - #C GAG TTC CCG CTC AAC     1300     His Glu Arg Ile Glu Phe Tyr Arg Ala Lys Gl - #y Glu Phe Pro Leu Asn     #           425     - GGT CCG GTC GAG ATC CGC TGC TGC GGG CTC GA - #T CAG GCA GCC GAC GTC     1348     Gly Pro Val Glu Ile Arg Cys Cys Gly Leu As - #p Gln Ala Ala Asp Val     #       440     - AAG GTG CCG TCG GTG GGC CCG CCG ACC ATC TC - #G GCG ACC CGT CCG CGT     1396     Lys Val Pro Ser Val Gly Pro Pro Thr Ile Se - #r Ala Thr Arg Pro Arg     #   455     - CCG GAT CAT CCG GAC TGG GAC GTC GCG ATC TG - #G CTG AAC GTT CTC GGT     1444     Pro Asp His Pro Asp Trp Asp Val Ala Ile Tr - #p Leu Asn Val Leu Gly     460                 4 - #65                 4 - #70                 4 -     #75     - GTT CCG GGC ACC CCC GGC ATG TTC GAG TTC TA - #C CGC GAG ATG GAG CAG     1492     Val Pro Gly Thr Pro Gly Met Phe Glu Phe Ty - #r Arg Glu Met Glu Gln     #               490     - TGG ATG CGG AGC CAC TAC AAC AAC GAC GAC GC - #C ACC TTC CGG CCC GAG     1540     Trp Met Arg Ser His Tyr Asn Asn Asp Asp Al - #a Thr Phe Arg Pro Glu     #           505     - TGG TCG AAG GGG TGG GCG TTC GGT CCC GAC CC - #G TAC ACC GAC AAC GAC     1588     Trp Ser Lys Gly Trp Ala Phe Gly Pro Asp Pr - #o Tyr Thr Asp Asn Asp     #       520     - ATC GTC ACG AAC AAG ATG CGC GCC ACC TAC AT - #C GAA GGT GTC CCG ACG     1636     Ile Val Thr Asn Lys Met Arg Ala Thr Tyr Il - #e Glu Gly Val Pro Thr     #   535     - ACC GAG AAC TGG GAC ACC GCG CGC GCT CGG TA - #C AAC CAG ATC GAC CCG     1684     Thr Glu Asn Trp Asp Thr Ala Arg Ala Arg Ty - #r Asn Gln Ile Asp Pro     540                 5 - #45                 5 - #50                 5 -     #55     - CAT CGC GTG TTC ACC AAC GGA TTC ATG GAC AA - #G CTG CTT CCG     #1726     His Arg Val Phe Thr Asn Gly Phe Met Asp Ly - #s Leu Leu Pro     #               565     - (2) INFORMATION FOR SEQ ID NO: 23:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 569 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: Protein     #23:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Ser Asn His His Gly His Ala Ser Thr Gl - #y Pro Val Ala Pro Leu     #                 15     - Pro Thr Pro Pro Asn Phe Pro Asn Asp Ile Al - #a Leu Phe Gln Gln Ala     #             30     - Tyr Gln Asn Trp Ser Lys Glu Ile Met Leu As - #p Ala Thr Trp Val Cys     #         45     - Ser Pro Lys Thr Pro Gln Asp Val Val Arg Le - #u Ala Asn Trp Ala His     #     60     - Glu His Asp Tyr Lys Ile Arg Pro Arg Gly Al - #a Met His Gly Trp Thr     # 80     - Pro Leu Thr Val Glu Lys Gly Ala Asn Val Gl - #u Lys Val Ile Leu Ala     #                 95     - Asp Thr Met Thr His Leu Asn Gly Ile Thr Va - #l Asn Thr Gly Gly Pro     #           110     - Val Ala Thr Val Thr Ala Gly Ala Gly Ala Se - #r Ile Glu Ala Ile Val     #       125     - Thr Glu Leu Gln Lys His Asp Leu Gly Trp Al - #a Asn Leu Pro Ala Pro     #   140     - Gly Val Leu Ser Ile Gly Gly Ala Leu Ala Va - #l Asn Ala His Gly Ala     145                 1 - #50                 1 - #55                 1 -     #60     - Ala Leu Pro Ala Val Gly Gln Thr Thr Leu Pr - #o Gly His Thr Tyr Gly     #               175     - Ser Leu Ser Asn Leu Val Thr Glu Leu Thr Al - #a Val Val Trp Asn Gly     #           190     - Thr Thr Tyr Ala Leu Glu Thr Tyr Gln Arg As - #n Asp Pro Arg Ile Thr     #       205     - Pro Leu Leu Thr Asn Leu Gly Arg Cys Phe Le - #u Thr Ser Val Thr Met     #   220     - Gln Ala Gly Pro Asn Phe Arg Gln Arg Cys Gl - #n Ser Tyr Thr Asp Ile     225                 2 - #30                 2 - #35                 2 -     #40     - Pro Trp Arg Glu Leu Phe Ala Pro Lys Gly Al - #a Asp Gly Arg Thr Phe     #               255     - Glu Lys Phe Val Ala Glu Ser Gly Gly Ala Gl - #u Ala Ile Trp Tyr Pro     #           270     - Phe Thr Glu Lys Pro Trp Met Lys Val Trp Th - #r Val Ser Pro Thr Lys     #       285     - Pro Asp Ser Ser Asn Glu Val Gly Ser Leu Gl - #y Ser Ala Gly Ser Leu     #   300     - Val Gly Lys Pro Pro Gln Ala Arg Glu Val Se - #r Gly Pro Tyr Asn Tyr     305                 3 - #10                 3 - #15                 3 -     #20     - Ile Phe Ser Asp Asn Leu Pro Glu Pro Ile Th - #r Asp Met Ile Gly Ala     #               335     - Ile Asn Ala Gly Asn Pro Gly Ile Ala Pro Le - #u Phe Gly Pro Ala Met     #           350     - Tyr Glu Ile Thr Lys Leu Gly Leu Ala Ala Th - #r Asn Ala Asn Asp Ile     #       365     - Trp Gly Trp Ser Lys Asp Val Gln Phe Tyr Il - #e Lys Ala Thr Thr Leu     #   380     - Arg Leu Thr Glu Gly Gly Gly Ala Val Val Th - #r Ser Arg Ala Asn Ile     385                 3 - #90                 3 - #95                 4 -     #00     - Ala Thr Val Ile Asn Asp Phe Thr Glu Trp Ph - #e His Glu Arg Ile Glu     #               415     - Phe Tyr Arg Ala Lys Gly Glu Phe Pro Leu As - #n Gly Pro Val Glu Ile     #           430     - Arg Cys Cys Gly Leu Asp Gln Ala Ala Asp Va - #l Lys Val Pro Ser Val     #       445     - Gly Pro Pro Thr Ile Ser Ala Thr Arg Pro Ar - #g Pro Asp His Pro Asp     #   460     - Trp Asp Val Ala Ile Trp Leu Asn Val Leu Gl - #y Val Pro Gly Thr Pro     465                 4 - #70                 4 - #75                 4 -     #80     - Gly Met Phe Glu Phe Tyr Arg Glu Met Glu Gl - #n Trp Met Arg Ser His     #               495     - Tyr Asn Asn Asp Asp Ala Thr Phe Arg Pro Gl - #u Trp Ser Lys Gly Trp     #           510     - Ala Phe Gly Pro Asp Pro Tyr Thr Asp Asn As - #p Ile Val Thr Asn Lys     #       525     - Met Arg Ala Thr Tyr Ile Glu Gly Val Pro Th - #r Thr Glu Asn Trp Asp     #   540     - Thr Ala Arg Ala Arg Tyr Asn Gln Ile Asp Pr - #o His Arg Val Phe Thr     545                 5 - #50                 5 - #55                 5 -     #60     - Asn Gly Phe Met Asp Lys Leu Leu Pro                     565     - (2) INFORMATION FOR SEQ ID NO: 24:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1728 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 19..1728     #24:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #CAT GCC TCG ACC       51 AGT AAT CAT CAC CAT GGG     Met Ser Asn His His His Gly His Ala Ser Th - #r     #               10     - GGG CCG GTC GCG CCG CTT CCG ACG CCG CCG AA - #C TTC CCG AAC GAC ATC       99     Gly Pro Val Ala Pro Leu Pro Thr Pro Pro As - #n Phe Pro Asn Asp Ile     #             25     - GCG CTG TTC CAG CAG GCG TAC CAG AAC TGG TC - #C AAG GAG ATC ATG CTG      147     Ala Leu Phe Gln Gln Ala Tyr Gln Asn Trp Se - #r Lys Glu Ile Met Leu     #         40     - GAC GCC ACT TGG GTC TGC TCG CCC AAG ACG CC - #G CAG GAT GTC GTT CGC      195     Asp Ala Thr Trp Val Cys Ser Pro Lys Thr Pr - #o Gln Asp Val Val Arg     #     55     - CTT GCC AAC TGG GCG CAC GAG CAC GAC TAC AA - #G ATC CGC CCG CGC GGC      243     Leu Ala Asn Trp Ala His Glu His Asp Tyr Ly - #s Ile Arg Pro Arg Gly     # 75     - GCG ATG CAC GGC TGG ACC CCG CTC ACC GTG GA - #G AAG GGG GCC AAC GTC      291     Ala Met His Gly Trp Thr Pro Leu Thr Val Gl - #u Lys Gly Ala Asn Val     #                 90     - GAG AAG GTG ATC CTC GCC GAC ACG ATG ACG CA - #T CTG AAC GGC ATC ACG      339     Glu Lys Val Ile Leu Ala Asp Thr Met Thr Hi - #s Leu Asn Gly Ile Thr     #            105     - GTG AAC ACG GGC GGC CCC GTG GCT ACC GTC AC - #C GCC GGT GCC GGC GCC      387     Val Asn Thr Gly Gly Pro Val Ala Thr Val Th - #r Ala Gly Ala Gly Ala     #       120     - AGC ATC GAG GCG ATC GTC ACC GAA CTG CAG AA - #G CAC GAC CTC GGC TGG      435     Ser Ile Glu Ala Ile Val Thr Glu Leu Gln Ly - #s His Asp Leu Gly Trp     #   135     - GCC AAC CTG CCC GCT CCG GGT GTG CTG TCG AT - #C GGT GGC GCC CTT GCG      483     Ala Asn Leu Pro Ala Pro Gly Val Leu Ser Il - #e Gly Gly Ala Leu Ala     140                 1 - #45                 1 - #50                 1 -     #55     - GTC AAC GCG CAC GGT GCG GCG CTG CCG GCC GT - #C GGC CAG ACC ACG CTG      531     Val Asn Ala His Gly Ala Ala Leu Pro Ala Va - #l Gly Gln Thr Thr Leu     #               170     - CCC GGT CAC ACC TAC GGT TCG CTG AGC AAC CT - #G GTC ACC GAG CTG ACC      579     Pro Gly His Thr Tyr Gly Ser Leu Ser Asn Le - #u Val Thr Glu Leu Thr     #           185     - GCG GTC GTC TGG AAC GGC ACC ACC TAC GCA CT - #C GAG ACG TAC CAG CGC      627     Ala Val Val Trp Asn Gly Thr Thr Tyr Ala Le - #u Glu Thr Tyr Gln Arg     #       200     - AAC GAT CCT CGG ATC ACC CCA CTG CTC ACC AA - #C CTC GGG CGC TGC TTC      675     Asn Asp Pro Arg Ile Thr Pro Leu Leu Thr As - #n Leu Gly Arg Cys Phe     #   215     - CTG ACC TCG GTG ACG ATG CAG GCC GGC CCC AA - #C TTC CGT CAG CGG TGC      723     Leu Thr Ser Val Thr Met Gln Ala Gly Pro As - #n Phe Arg Gln Arg Cys     220                 2 - #25                 2 - #30                 2 -     #35     - CAG AGC TAC ACC GAC ATC CCG TGG CGG GAA CT - #G TTC GCG CCG AAG GGC      771     Gln Ser Tyr Thr Asp Ile Pro Trp Arg Glu Le - #u Phe Ala Pro Lys Gly     #               250     - GCC GAC GGC CGC ACG TTC GAG AAG TTC GTC GC - #G GAA TCG GGC GGC GCC      819     Ala Asp Gly Arg Thr Phe Glu Lys Phe Val Al - #a Glu Ser Gly Gly Ala     #           265     - GAG GCG ATC TGG TAC CCG TTC ACC GAG AAG CC - #G TGG ATG AAG GTG TGG      867     Glu Ala Ile Trp Tyr Pro Phe Thr Glu Lys Pr - #o Trp Met Lys Val Trp     #       280     - ACG GTC TCG CCG ACC AAG CCG GAC TCG TCG AA - #C GAG GTC GGA AGC CTC      915     Thr Val Ser Pro Thr Lys Pro Asp Ser Ser As - #n Glu Val Gly Ser Leu     #   295     - GGC TCG GCG GGC TCC CTC GTC GGC AAG CCT CC - #G CAG GCG CGT GAG GTC      963     Gly Ser Ala Gly Ser Leu Val Gly Lys Pro Pr - #o Gln Ala Arg Glu Val     300                 3 - #05                 3 - #10                 3 -     #15     - TCC GGC CCG TAC AAC TAC ATC TTC TCC GAC AA - #C CTG CCG GAG CCC ATC     1011     Ser Gly Pro Tyr Asn Tyr Ile Phe Ser Asp As - #n Leu Pro Glu Pro Ile     #               330     - ACC GAC ATG ATC GGC GCC ATC AAC GCC GGA AA - #C CCC GGA ATC GCA CCG     1059     Thr Asp Met Ile Gly Ala Ile Asn Ala Gly As - #n Pro Gly Ile Ala Pro     #           345     - CTG TTC GGC CCG GCG ATG TAC GAG ATC ACC AA - #G CTC GGG CTG GCC GCG     1107     Leu Phe Gly Pro Ala Met Tyr Glu Ile Thr Ly - #s Leu Gly Leu Ala Ala     #       360     - ACG AAT GCC AAC GAC ATC TGG GGC TGG TCG AA - #G GAC GTC CAG TTC TAC     1155     Thr Asn Ala Asn Asp Ile Trp Gly Trp Ser Ly - #s Asp Val Gln Phe Tyr     #   375     - ATC AAG GCC ACG ACG TTG CGA CTC ACC GAG GG - #C GGC GGC GCC GTC GTC     1203     Ile Lys Ala Thr Thr Leu Arg Leu Thr Glu Gl - #y Gly Gly Ala Val Val     380                 3 - #85                 3 - #90                 3 -     #95     - ACG AGC CGC GCC AAC ATC GCG ACC GTG ATC AA - #C GAC TTC ACC GAG TGG     1251     Thr Ser Arg Ala Asn Ile Ala Thr Val Ile As - #n Asp Phe Thr Glu Trp     #               410     - TTC CAC GAG CGC ATC GAG TTC TAC CGC GCG AA - #G GGC GAG TTC CCG CTC     1299     Phe His Glu Arg Ile Glu Phe Tyr Arg Ala Ly - #s Gly Glu Phe Pro Leu     #           425     - AAC GGT CCG GTC GAG ATC CGC TGC TGC GGG CT - #C GAT CAG GCA GCC GAC     1347     Asn Gly Pro Val Glu Ile Arg Cys Cys Gly Le - #u Asp Gln Ala Ala Asp     #       440     - GTC AAG GTG CCG TCG GTG GGC CCG CCG ACC AT - #C TCG GCG ACC CGT CCG     1395     Val Lys Val Pro Ser Val Gly Pro Pro Thr Il - #e Ser Ala Thr Arg Pro     #   455     - CGT CCG GAT CAT CCG GAC TGG GAC GTC GCG AT - #C TGG CTG AAC GTT CTC     1443     Arg Pro Asp His Pro Asp Trp Asp Val Ala Il - #e Trp Leu Asn Val Leu     460                 4 - #65                 4 - #70                 4 -     #75     - GGT GTT CCG GGC ACC CCC GGC ATG TTC GAG TT - #C TAC CGC GAG ATG GAG     1491     Gly Val Pro Gly Thr Pro Gly Met Phe Glu Ph - #e Tyr Arg Glu Met Glu     #               490     - CAG TGG ATG CGG AGC CAC TAC AAC AAC GAC GA - #C GCC ACC TTC CGG CCC     1539     Gln Trp Met Arg Ser His Tyr Asn Asn Asp As - #p Ala Thr Phe Arg Pro     #           505     - GAG TGG TCG AAG GGG TGG GCG TTC GGT CCC GA - #C CCG TAC ACC GAC AAC     1587     Glu Trp Ser Lys Gly Trp Ala Phe Gly Pro As - #p Pro Tyr Thr Asp Asn     #       520     - GAC ATC GTC ACG AAC AAG ATG CGC GCC ACC TA - #C ATC GAA GGT GTC CCG     1635     Asp Ile Val Thr Asn Lys Met Arg Ala Thr Ty - #r Ile Glu Gly Val Pro     #   535     - ACG ACC GAG AAC TGG GAC ACC GCG CGC GCT CG - #G TAC AAC CAG ATC GAC     1683     Thr Thr Glu Asn Trp Asp Thr Ala Arg Ala Ar - #g Tyr Asn Gln Ile Asp     540                 5 - #45                 5 - #50                 5 -     #55     - CCG CAT CGC GTG TTC ACC AAC GGA TTC ATG GA - #C AAG CTG CTT CCG     1728     Pro His Arg Val Phe Thr Asn Gly Phe Met As - #p Lys Leu Leu Pro     #               570     - (2) INFORMATION FOR SEQ ID NO: 25:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 570 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: Protein     #25:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Ser Asn His His His Gly His Ala Ser Th - #r Gly Pro Val Ala Pro     #                 15     - Leu Pro Thr Pro Pro Asn Phe Pro Asn Asp Il - #e Ala Leu Phe Gln Gln     #             30     - Ala Tyr Gln Asn Trp Ser Lys Glu Ile Met Le - #u Asp Ala Thr Trp Val     #         45     - Cys Ser Pro Lys Thr Pro Gln Asp Val Val Ar - #g Leu Ala Asn Trp Ala     #     60     - His Glu His Asp Tyr Lys Ile Arg Pro Arg Gl - #y Ala Met His Gly Trp     # 80     - Thr Pro Leu Thr Val Glu Lys Gly Ala Asn Va - #l Glu Lys Val Ile Leu     #                 95     - Ala Asp Thr Met Thr His Leu Asn Gly Ile Th - #r Val Asn Thr Gly Gly     #           110     - Pro Val Ala Thr Val Thr Ala Gly Ala Gly Al - #a Ser Ile Glu Ala Ile     #       125     - Val Thr Glu Leu Gln Lys His Asp Leu Gly Tr - #p Ala Asn Leu Pro Ala     #   140     - Pro Gly Val Leu Ser Ile Gly Gly Ala Leu Al - #a Val Asn Ala His Gly     145                 1 - #50                 1 - #55                 1 -     #60     - Ala Ala Leu Pro Ala Val Gly Gln Thr Thr Le - #u Pro Gly His Thr Tyr     #               175     - Gly Ser Leu Ser Asn Leu Val Thr Glu Leu Th - #r Ala Val Val Trp Asn     #           190     - Gly Thr Thr Tyr Ala Leu Glu Thr Tyr Gln Ar - #g Asn Asp Pro Arg Ile     #       205     - Thr Pro Leu Leu Thr Asn Leu Gly Arg Cys Ph - #e Leu Thr Ser Val Thr     #   220     - Met Gln Ala Gly Pro Asn Phe Arg Gln Arg Cy - #s Gln Ser Tyr Thr Asp     225                 2 - #30                 2 - #35                 2 -     #40     - Ile Pro Trp Arg Glu Leu Phe Ala Pro Lys Gl - #y Ala Asp Gly Arg Thr     #               255     - Phe Glu Lys Phe Val Ala Glu Ser Gly Gly Al - #a Glu Ala Ile Trp Tyr     #           270     - Pro Phe Thr Glu Lys Pro Trp Met Lys Val Tr - #p Thr Val Ser Pro Thr     #       285     - Lys Pro Asp Ser Ser Asn Glu Val Gly Ser Le - #u Gly Ser Ala Gly Ser     #   300     - Leu Val Gly Lys Pro Pro Gln Ala Arg Glu Va - #l Ser Gly Pro Tyr Asn     305                 3 - #10                 3 - #15                 3 -     #20     - Tyr Ile Phe Ser Asp Asn Leu Pro Glu Pro Il - #e Thr Asp Met Ile Gly     #               335     - Ala Ile Asn Ala Gly Asn Pro Gly Ile Ala Pr - #o Leu Phe Gly Pro Ala     #           350     - Met Tyr Glu Ile Thr Lys Leu Gly Leu Ala Al - #a Thr Asn Ala Asn Asp     #       365     - Ile Trp Gly Trp Ser Lys Asp Val Gln Phe Ty - #r Ile Lys Ala Thr Thr     #   380     - Leu Arg Leu Thr Glu Gly Gly Gly Ala Val Va - #l Thr Ser Arg Ala Asn     385                 3 - #90                 3 - #95                 4 -     #00     - Ile Ala Thr Val Ile Asn Asp Phe Thr Glu Tr - #p Phe His Glu Arg Ile     #               415     - Glu Phe Tyr Arg Ala Lys Gly Glu Phe Pro Le - #u Asn Gly Pro Val Glu     #           430     - Ile Arg Cys Cys Gly Leu Asp Gln Ala Ala As - #p Val Lys Val Pro Ser     #       445     - Val Gly Pro Pro Thr Ile Ser Ala Thr Arg Pr - #o Arg Pro Asp His Pro     #   460     - Asp Trp Asp Val Ala Ile Trp Leu Asn Val Le - #u Gly Val Pro Gly Thr     465                 4 - #70                 4 - #75                 4 -     #80     - Pro Gly Met Phe Glu Phe Tyr Arg Glu Met Gl - #u Gln Trp Met Arg Ser     #               495     - His Tyr Asn Asn Asp Asp Ala Thr Phe Arg Pr - #o Glu Trp Ser Lys Gly     #           510     - Trp Ala Phe Gly Pro Asp Pro Tyr Thr Asp As - #n Asp Ile Val Thr Asn     #       525     - Lys Met Arg Ala Thr Tyr Ile Glu Gly Val Pr - #o Thr Thr Glu Asn Trp     #   540     - Asp Thr Ala Arg Ala Arg Tyr Asn Gln Ile As - #p Pro His Arg Val Phe     545                 5 - #50                 5 - #55                 5 -     #60     - Thr Asn Gly Phe Met Asp Lys Leu Leu Pro     #               570     - (2) INFORMATION FOR SEQ ID NO: 26:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1741 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 20..1741     #26:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - GAATTTAAGG GGAACATCG ATG AGT AAT ACG CGT AAA CGC - # AAG CGC CGT ACG       52     Met Ser Asn Thr Arg Lys Arg Lys Arg Arg Th - #r     #                10     - CAT GCC TCG ACC GGG CCG GTC GCG CCG CTT CC - #G ACG CCG CCG AAC TTC      100     His Ala Ser Thr Gly Pro Val Ala Pro Leu Pr - #o Thr Pro Pro Asn Phe     #             25     - CCG AAC GAC ATC GCG CTG TTC CAG CAG GCG TA - #C CAG AAC TGG TCC AAG      148     Pro Asn Asp Ile Ala Leu Phe Gln Gln Ala Ty - #r Gln Asn Trp Ser Lys     #         40     - GAG ATC ATG CTG GAC GCC ACT TGG GTC TGC TC - #G CCC AAG ACG CCG CAG      196     Glu Ile Met Leu Asp Ala Thr Trp Val Cys Se - #r Pro Lys Thr Pro Gln     #     55     - GAT GTC GTT CGC CTT GCC AAC TGG GCG CAC GA - #G CAC GAC TAC AAG ATC      244     Asp Val Val Arg Leu Ala Asn Trp Ala His Gl - #u His Asp Tyr Lys Ile     # 75     - CGC CCG CGC GGC GCG ATG CAC GGC TGG ACC CC - #G CTC ACC GTG GAG AAG      292     Arg Pro Arg Gly Ala Met His Gly Trp Thr Pr - #o Leu Thr Val Glu Lys     #                 90     - GGG GCC AAC GTC GAG AAG GTG ATC CTC GCC GA - #C ACG ATG ACG CAT CTG      340     Gly Ala Asn Val Glu Lys Val Ile Leu Ala As - #p Thr Met Thr His Leu     #            105     - AAC GGC ATC ACG GTG AAC ACG GGC GGC CCC GT - #G GCT ACC GTC ACC GCC      388     Asn Gly Ile Thr Val Asn Thr Gly Gly Pro Va - #l Ala Thr Val Thr Ala     #       120     - GGT GCC GGC GCC AGC ATC GAG GCG ATC GTC AC - #C GAA CTG CAG AAG CAC      436     Gly Ala Gly Ala Ser Ile Glu Ala Ile Val Th - #r Glu Leu Gln Lys His     #   135     - GAC CTC GGC TGG GCC AAC CTG CCC GCT CCG GG - #T GTG CTG TCG ATC GGT      484     Asp Leu Gly Trp Ala Asn Leu Pro Ala Pro Gl - #y Val Leu Ser Ile Gly     140                 1 - #45                 1 - #50                 1 -     #55     - GGC GCC CTT GCG GTC AAC GCG CAC GGT GCG GC - #G CTG CCG GCC GTC GGC      532     Gly Ala Leu Ala Val Asn Ala His Gly Ala Al - #a Leu Pro Ala Val Gly     #               170     - CAG ACC ACG CTG CCC GGT CAC ACC TAC GGT TC - #G CTG AGC AAC CTG GTC      580     Gln Thr Thr Leu Pro Gly His Thr Tyr Gly Se - #r Leu Ser Asn Leu Val     #           185     - ACC GAG CTG ACC GCG GTC GTC TGG AAC GGC AC - #C ACC TAC GCA CTC GAG      628     Thr Glu Leu Thr Ala Val Val Trp Asn Gly Th - #r Thr Tyr Ala Leu Glu     #       200     - ACG TAC CAG CGC AAC GAT CCT CGG ATC ACC CC - #A CTG CTC ACC AAC CTC      676     Thr Tyr Gln Arg Asn Asp Pro Arg Ile Thr Pr - #o Leu Leu Thr Asn Leu     #   215     - GGG CGC TGC TTC CTG ACC TCG GTG ACG ATG CA - #G GCC GGC CCC AAC TTC      724     Gly Arg Cys Phe Leu Thr Ser Val Thr Met Gl - #n Ala Gly Pro Asn Phe     220                 2 - #25                 2 - #30                 2 -     #35     - CGT CAG CGG TGC CAG AGC TAC ACC GAC ATC CC - #G TGG CGG GAA CTG TTC      772     Arg Gln Arg Cys Gln Ser Tyr Thr Asp Ile Pr - #o Trp Arg Glu Leu Phe     #               250     - GCG CCG AAG GGC GCC GAC GGC CGC ACG TTC GA - #G AAG TTC GTC GCG GAA      820     Ala Pro Lys Gly Ala Asp Gly Arg Thr Phe Gl - #u Lys Phe Val Ala Glu     #           265     - TCG GGC GGC GCC GAG GCG ATC TGG TAC CCG TT - #C ACC GAG AAG CCG TGG      868     Ser Gly Gly Ala Glu Ala Ile Trp Tyr Pro Ph - #e Thr Glu Lys Pro Trp     #       280     - ATG AAG GTG TGG ACG GTC TCG CCG ACC AAG CC - #G GAC TCG TCG AAC GAG      916     Met Lys Val Trp Thr Val Ser Pro Thr Lys Pr - #o Asp Ser Ser Asn Glu     #   295     - GTC GGA AGC CTC GGC TCG GCG GGC TCC CTC GT - #C GGC AAG CCT CCG CAG      964     Val Gly Ser Leu Gly Ser Ala Gly Ser Leu Va - #l Gly Lys Pro Pro Gln     300                 3 - #05                 3 - #10                 3 -     #15     - GCG CGT GAG GTC TCC GGC CCG TAC AAC TAC AT - #C TTC TCC GAC AAC CTG     1012     Ala Arg Glu Val Ser Gly Pro Tyr Asn Tyr Il - #e Phe Ser Asp Asn Leu     #               330     - CCG GAG CCC ATC ACC GAC ATG ATC GGC GCC AT - #C AAC GCC GGA AAC CCC     1060     Pro Glu Pro Ile Thr Asp Met Ile Gly Ala Il - #e Asn Ala Gly Asn Pro     #           345     - GGA ATC GCA CCG CTG TTC GGC CCG GCG ATG TA - #C GAG ATC ACC AAG CTC     1108     Gly Ile Ala Pro Leu Phe Gly Pro Ala Met Ty - #r Glu Ile Thr Lys Leu     #       360     - GGG CTG GCC GCG ACG AAT GCC AAC GAC ATC TG - #G GGC TGG TCG AAG GAC     1156     Gly Leu Ala Ala Thr Asn Ala Asn Asp Ile Tr - #p Gly Trp Ser Lys Asp     #   375     - GTC CAG TTC TAC ATC AAG GCC ACG ACG TTG CG - #A CTC ACC GAG GGC GGC     1204     Val Gln Phe Tyr Ile Lys Ala Thr Thr Leu Ar - #g Leu Thr Glu Gly Gly     380                 3 - #85                 3 - #90                 3 -     #95     - GGC GCC GTC GTC ACG AGC CGC GCC AAC ATC GC - #G ACC GTG ATC AAC GAC     1252     Gly Ala Val Val Thr Ser Arg Ala Asn Ile Al - #a Thr Val Ile Asn Asp     #               410     - TTC ACC GAG TGG TTC CAC GAG CGC ATC GAG TT - #C TAC CGC GCG AAG GGC     1300     Phe Thr Glu Trp Phe His Glu Arg Ile Glu Ph - #e Tyr Arg Ala Lys Gly     #           425     - GAG TTC CCG CTC AAC GGT CCG GTC GAG ATC CG - #C TGC TGC GGG CTC GAT     1348     Glu Phe Pro Leu Asn Gly Pro Val Glu Ile Ar - #g Cys Cys Gly Leu Asp     #       440     - CAG GCA GCC GAC GTC AAG GTG CCG TCG GTG GG - #C CCG CCG ACC ATC TCG     1396     Gln Ala Ala Asp Val Lys Val Pro Ser Val Gl - #y Pro Pro Thr Ile Ser     #   455     - GCG ACC CGT CCG CGT CCG GAT CAT CCG GAC TG - #G GAC GTC GCG ATC TGG     1444     Ala Thr Arg Pro Arg Pro Asp His Pro Asp Tr - #p Asp Val Ala Ile Trp     460                 4 - #65                 4 - #70                 4 -     #75     - CTG AAC GTT CTC GGT GTT CCG GGC ACC CCC GG - #C ATG TTC GAG TTC TAC     1492     Leu Asn Val Leu Gly Val Pro Gly Thr Pro Gl - #y Met Phe Glu Phe Tyr     #               490     - CGC GAG ATG GAG CAG TGG ATG CGG AGC CAC TA - #C AAC AAC GAC GAC GCC     1540     Arg Glu Met Glu Gln Trp Met Arg Ser His Ty - #r Asn Asn Asp Asp Ala     #           505     - ACC TTC CGG CCC GAG TGG TCG AAG GGG TGG GC - #G TTC GGT CCC GAC CCG     1588     Thr Phe Arg Pro Glu Trp Ser Lys Gly Trp Al - #a Phe Gly Pro Asp Pro     #       520     - TAC ACC GAC AAC GAC ATC GTC ACG AAC AAG AT - #G CGC GCC ACC TAC ATC     1636     Tyr Thr Asp Asn Asp Ile Val Thr Asn Lys Me - #t Arg Ala Thr Tyr Ile     #   535     - GAA GGT GTC CCG ACG ACC GAG AAC TGG GAC AC - #C GCG CGC GCT CGG TAC     1684     Glu Gly Val Pro Thr Thr Glu Asn Trp Asp Th - #r Ala Arg Ala Arg Tyr     540                 5 - #45                 5 - #50                 5 -     #55     - AAC CAG ATC GAC CCG CAT CGC GTG TTC ACC AA - #C GGA TTC ATG GAC AAG     1732     Asn Gln Ile Asp Pro His Arg Val Phe Thr As - #n Gly Phe Met Asp Lys     #               570     #       1741     Leu Leu Pro     - (2) INFORMATION FOR SEQ ID NO: 27:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 574 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: Protein     #27:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Ser Asn Thr Arg Lys Arg Lys Arg Arg Th - #r His Ala Ser Thr Gly     #                 15     - Pro Val Ala Pro Leu Pro Thr Pro Pro Asn Ph - #e Pro Asn Asp Ile Ala     #             30     - Leu Phe Gln Gln Ala Tyr Gln Asn Trp Ser Ly - #s Glu Ile Met Leu Asp     #         45     - Ala Thr Trp Val Cys Ser Pro Lys Thr Pro Gl - #n Asp Val Val Arg Leu     #     60     - Ala Asn Trp Ala His Glu His Asp Tyr Lys Il - #e Arg Pro Arg Gly Ala     # 80     - Met His Gly Trp Thr Pro Leu Thr Val Glu Ly - #s Gly Ala Asn Val Glu     #                 95     - Lys Val Ile Leu Ala Asp Thr Met Thr His Le - #u Asn Gly Ile Thr Val     #           110     - Asn Thr Gly Gly Pro Val Ala Thr Val Thr Al - #a Gly Ala Gly Ala Ser     #       125     - Ile Glu Ala Ile Val Thr Glu Leu Gln Lys Hi - #s Asp Leu Gly Trp Ala     #   140     - Asn Leu Pro Ala Pro Gly Val Leu Ser Ile Gl - #y Gly Ala Leu Ala Val     145                 1 - #50                 1 - #55                 1 -     #60     - Asn Ala His Gly Ala Ala Leu Pro Ala Val Gl - #y Gln Thr Thr Leu Pro     #               175     - Gly His Thr Tyr Gly Ser Leu Ser Asn Leu Va - #l Thr Glu Leu Thr Ala     #           190     - Val Val Trp Asn Gly Thr Thr Tyr Ala Leu Gl - #u Thr Tyr Gln Arg Asn     #       205     - Asp Pro Arg Ile Thr Pro Leu Leu Thr Asn Le - #u Gly Arg Cys Phe Leu     #   220     - Thr Ser Val Thr Met Gln Ala Gly Pro Asn Ph - #e Arg Gln Arg Cys Gln     225                 2 - #30                 2 - #35                 2 -     #40     - Ser Tyr Thr Asp Ile Pro Trp Arg Glu Leu Ph - #e Ala Pro Lys Gly Ala     #               255     - Asp Gly Arg Thr Phe Glu Lys Phe Val Ala Gl - #u Ser Gly Gly Ala Glu     #           270     - Ala Ile Trp Tyr Pro Phe Thr Glu Lys Pro Tr - #p Met Lys Val Trp Thr     #       285     - Val Ser Pro Thr Lys Pro Asp Ser Ser Asn Gl - #u Val Gly Ser Leu Gly     #   300     - Ser Ala Gly Ser Leu Val Gly Lys Pro Pro Gl - #n Ala Arg Glu Val Ser     305                 3 - #10                 3 - #15                 3 -     #20     - Gly Pro Tyr Asn Tyr Ile Phe Ser Asp Asn Le - #u Pro Glu Pro Ile Thr     #               335     - Asp Met Ile Gly Ala Ile Asn Ala Gly Asn Pr - #o Gly Ile Ala Pro Leu     #           350     - Phe Gly Pro Ala Met Tyr Glu Ile Thr Lys Le - #u Gly Leu Ala Ala Thr     #       365     - Asn Ala Asn Asp Ile Trp Gly Trp Ser Lys As - #p Val Gln Phe Tyr Ile     #   380     - Lys Ala Thr Thr Leu Arg Leu Thr Glu Gly Gl - #y Gly Ala Val Val Thr     385                 3 - #90                 3 - #95                 4 -     #00     - Ser Arg Ala Asn Ile Ala Thr Val Ile Asn As - #p Phe Thr Glu Trp Phe     #               415     - His Glu Arg Ile Glu Phe Tyr Arg Ala Lys Gl - #y Glu Phe Pro Leu Asn     #           430     - Gly Pro Val Glu Ile Arg Cys Cys Gly Leu As - #p Gln Ala Ala Asp Val     #       445     - Lys Val Pro Ser Val Gly Pro Pro Thr Ile Se - #r Ala Thr Arg Pro Arg     #   460     - Pro Asp His Pro Asp Trp Asp Val Ala Ile Tr - #p Leu Asn Val Leu Gly     465                 4 - #70                 4 - #75                 4 -     #80     - Val Pro Gly Thr Pro Gly Met Phe Glu Phe Ty - #r Arg Glu Met Glu Gln     #               495     - Trp Met Arg Ser His Tyr Asn Asn Asp Asp Al - #a Thr Phe Arg Pro Glu     #           510     - Trp Ser Lys Gly Trp Ala Phe Gly Pro Asp Pr - #o Tyr Thr Asp Asn Asp     #       525     - Ile Val Thr Asn Lys Met Arg Ala Thr Tyr Il - #e Glu Gly Val Pro Thr     #   540     - Thr Glu Asn Trp Asp Thr Ala Arg Ala Arg Ty - #r Asn Gln Ile Asp Pro     545                 5 - #50                 5 - #55                 5 -     #60     - His Arg Val Phe Thr Asn Gly Phe Met Asp Ly - #s Leu Leu Pro     #               570     - (2) INFORMATION FOR SEQ ID NO: 28:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1731 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 25..1731     #28:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - GAATTCACAC AGGAAACAGA ATTC ATG GTT ATG CAC CAT G - #GG CAT GCC TCG       51     #Gly His Ala Serl Met His His     #      5  1     - ACC GGG CCG GTC GCG CCG CTT CCG ACG CCG CC - #G AAC TTC CCG AAC GAC       99     Thr Gly Pro Val Ala Pro Leu Pro Thr Pro Pr - #o Asn Phe Pro Asn Asp     # 25     - ATC GCG CTG TTC CAG CAG GCG TAC CAG AAC TG - #G TCC AAG GAG ATC ATG      147     Ile Ala Leu Phe Gln Gln Ala Tyr Gln Asn Tr - #p Ser Lys Glu Ile Met     #                 40     - CTG GAC GCC ACT TGG GTC TGC TCG CCC AAG AC - #G CCG CAG GAT GTC GTT      195     Leu Asp Ala Thr Trp Val Cys Ser Pro Lys Th - #r Pro Gln Asp Val Val     #             55     - CGC CTT GCC AAC TGG GCG CAC GAG CAC GAC TA - #C AAG ATC CGC CCG CGC      243     Arg Leu Ala Asn Trp Ala His Glu His Asp Ty - #r Lys Ile Arg Pro Arg     #         70     - GGC GCG ATG CAC GGC TGG ACC CCG CTC ACC GT - #G GAG AAG GGG GCC AAC      291     Gly Ala Met His Gly Trp Thr Pro Leu Thr Va - #l Glu Lys Gly Ala Asn     #     85     - GTC GAG AAG GTG ATC CTC GCC GAC ACG ATG AC - #G CAT CTG AAC GGC ATC      339     Val Glu Lys Val Ile Leu Ala Asp Thr Met Th - #r His Leu Asn Gly Ile     #105     - ACG GTG AAC ACG GGC GGC CCC GTG GCT ACC GT - #C ACC GCC GGT GCC GGC      387     Thr Val Asn Thr Gly Gly Pro Val Ala Thr Va - #l Thr Ala Gly Ala Gly     #               120     - GCC AGC ATC GAG GCG ATC GTC ACC GAA CTG CA - #G AAG CAC GAC CTC GGC      435     Ala Ser Ile Glu Ala Ile Val Thr Glu Leu Gl - #n Lys His Asp Leu Gly     #           135     - TGG GCC AAC CTG CCC GCT CCG GGT GTG CTG TC - #G ATC GGT GGC GCC CTT      483     Trp Ala Asn Leu Pro Ala Pro Gly Val Leu Se - #r Ile Gly Gly Ala Leu     #       150     - GCG GTC AAC GCG CAC GGT GCG GCG CTG CCG GC - #C GTC GGC CAG ACC ACG      531     Ala Val Asn Ala His Gly Ala Ala Leu Pro Al - #a Val Gly Gln Thr Thr     #   165     - CTG CCC GGT CAC ACC TAC GGT TCG CTG AGC AA - #C CTG GTC ACC GAG CTG      579     Leu Pro Gly His Thr Tyr Gly Ser Leu Ser As - #n Leu Val Thr Glu Leu     170                 1 - #75                 1 - #80                 1 -     #85     - ACC GCG GTC GTC TGG AAC GGC ACC ACC TAC GC - #A CTC GAG ACG TAC CAG      627     Thr Ala Val Val Trp Asn Gly Thr Thr Tyr Al - #a Leu Glu Thr Tyr Gln     #               200     - CGC AAC GAT CCT CGG ATC ACC CCA CTG CTC AC - #C AAC CTC GGG CGC TGC      675     Arg Asn Asp Pro Arg Ile Thr Pro Leu Leu Th - #r Asn Leu Gly Arg Cys     #           215     - TTC CTG ACC TCG GTG ACG ATG CAG GCC GGC CC - #C AAC TTC CGT CAG CGG      723     Phe Leu Thr Ser Val Thr Met Gln Ala Gly Pr - #o Asn Phe Arg Gln Arg     #       230     - TGC CAG AGC TAC ACC GAC ATC CCG TGG CGG GA - #A CTG TTC GCG CCG AAG      771     Cys Gln Ser Tyr Thr Asp Ile Pro Trp Arg Gl - #u Leu Phe Ala Pro Lys     #   245     - GGC GCC GAC GGC CGC ACG TTC GAG AAG TTC GT - #C GCG GAA TCG GGC GGC      819     Gly Ala Asp Gly Arg Thr Phe Glu Lys Phe Va - #l Ala Glu Ser Gly Gly     250                 2 - #55                 2 - #60                 2 -     #65     - GCC GAG GCG ATC TGG TAC CCG TTC ACC GAG AA - #G CCG TGG ATG AAG GTG      867     Ala Glu Ala Ile Trp Tyr Pro Phe Thr Glu Ly - #s Pro Trp Met Lys Val     #               280     - TGG ACG GTC TCG CCG ACC AAG CCG GAC TCG TC - #G AAC GAG GTC GGA AGC      915     Trp Thr Val Ser Pro Thr Lys Pro Asp Ser Se - #r Asn Glu Val Gly Ser     #           295     - CTC GGC TCG GCG GGC TCC CTC GTC GGC AAG CC - #T CCG CAG GCG CGT GAG      963     Leu Gly Ser Ala Gly Ser Leu Val Gly Lys Pr - #o Pro Gln Ala Arg Glu     #       310     - GTC TCC GGC CCG TAC AAC TAC ATC TTC TCC GA - #C AAC CTG CCG GAG CCC     1011     Val Ser Gly Pro Tyr Asn Tyr Ile Phe Ser As - #p Asn Leu Pro Glu Pro     #   325     - ATC ACC GAC ATG ATC GGC GCC ATC AAC GCC GG - #A AAC CCC GGA ATC GCA     1059     Ile Thr Asp Met Ile Gly Ala Ile Asn Ala Gl - #y Asn Pro Gly Ile Ala     330                 3 - #35                 3 - #40                 3 -     #45     - CCG CTG TTC GGC CCG GCG ATG TAC GAG ATC AC - #C AAG CTC GGG CTG GCC     1107     Pro Leu Phe Gly Pro Ala Met Tyr Glu Ile Th - #r Lys Leu Gly Leu Ala     #               360     - GCG ACG AAT GCC AAC GAC ATC TGG GGC TGG TC - #G AAG GAC GTC CAG TTC     1155     Ala Thr Asn Ala Asn Asp Ile Trp Gly Trp Se - #r Lys Asp Val Gln Phe     #           375     - TAC ATC AAG GCC ACG ACG TTG CGA CTC ACC GA - #G GGC GGC GGC GCC GTC     1203     Tyr Ile Lys Ala Thr Thr Leu Arg Leu Thr Gl - #u Gly Gly Gly Ala Val     #       390     - GTC ACG AGC CGC GCC AAC ATC GCG ACC GTG AT - #C AAC GAC TTC ACC GAG     1251     Val Thr Ser Arg Ala Asn Ile Ala Thr Val Il - #e Asn Asp Phe Thr Glu     #   405     - TGG TTC CAC GAG CGC ATC GAG TTC TAC CGC GC - #G AAG GGC GAG TTC CCG     1299     Trp Phe His Glu Arg Ile Glu Phe Tyr Arg Al - #a Lys Gly Glu Phe Pro     410                 4 - #15                 4 - #20                 4 -     #25     - CTC AAC GGT CCG GTC GAG ATC CGC TGC TGC GG - #G CTC GAT CAG GCA GCC     1347     Leu Asn Gly Pro Val Glu Ile Arg Cys Cys Gl - #y Leu Asp Gln Ala Ala     #               440     - GAC GTC AAG GTG CCG TCG GTG GGC CCG CCG AC - #C ATC TCG GCG ACC CGT     1395     Asp Val Lys Val Pro Ser Val Gly Pro Pro Th - #r Ile Ser Ala Thr Arg     #           455     - CCG CGT CCG GAT CAT CCG GAC TGG GAC GTC GC - #G ATC TGG CTG AAC GTT     1443     Pro Arg Pro Asp His Pro Asp Trp Asp Val Al - #a Ile Trp Leu Asn Val     #       470     - CTC GGT GTT CCG GGC ACC CCC GGC ATG TTC GA - #G TTC TAC CGC GAG ATG     1491     Leu Gly Val Pro Gly Thr Pro Gly Met Phe Gl - #u Phe Tyr Arg Glu Met     #   485     - GAG CAG TGG ATG CGG AGC CAC TAC AAC AAC GA - #C GAC GCC ACC TTC CGG     1539     Glu Gln Trp Met Arg Ser His Tyr Asn Asn As - #p Asp Ala Thr Phe Arg     490                 4 - #95                 5 - #00                 5 -     #05     - CCC GAG TGG TCG AAG GGG TGG GCG TTC GGT CC - #C GAC CCG TAC ACC GAC     1587     Pro Glu Trp Ser Lys Gly Trp Ala Phe Gly Pr - #o Asp Pro Tyr Thr Asp     #               520     - AAC GAC ATC GTC ACG AAC AAG ATG CGC GCC AC - #C TAC ATC GAA GGT GTC     1635     Asn Asp Ile Val Thr Asn Lys Met Arg Ala Th - #r Tyr Ile Glu Gly Val     #           535     - CCG ACG ACC GAG AAC TGG GAC ACC GCG CGC GC - #T CGG TAC AAC CAG ATC     1683     Pro Thr Thr Glu Asn Trp Asp Thr Ala Arg Al - #a Arg Tyr Asn Gln Ile     #       550     - GAC CCG CAT CGC GTG TTC ACC AAC GGA TTC AT - #G GAC AAG CTG CTT CCG     1731     Asp Pro His Arg Val Phe Thr Asn Gly Phe Me - #t Asp Lys Leu Leu Pro     #   565     - (2) INFORMATION FOR SEQ ID NO: 29:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 569 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: Protein     #29:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     - Met Val Met His His Gly His Ala Ser Thr Gl - #y Pro Val Ala Pro Leu     #                 15     - Pro Thr Pro Pro Asn Phe Pro Asn Asp Ile Al - #a Leu Phe Gln Gln Ala     #             30     - Tyr Gln Asn Trp Ser Lys Glu Ile Met Leu As - #p Ala Thr Trp Val Cys     #         45     - Ser Pro Lys Thr Pro Gln Asp Val Val Arg Le - #u Ala Asn Trp Ala His     #     60     - Glu His Asp Tyr Lys Ile Arg Pro Arg Gly Al - #a Met His Gly Trp Thr     # 80     - Pro Leu Thr Val Glu Lys Gly Ala Asn Val Gl - #u Lys Val Ile Leu Ala     #                 95     - Asp Thr Met Thr His Leu Asn Gly Ile Thr Va - #l Asn Thr Gly Gly Pro     #           110     - Val Ala Thr Val Thr Ala Gly Ala Gly Ala Se - #r Ile Glu Ala Ile Val     #       125     - Thr Glu Leu Gln Lys His Asp Leu Gly Trp Al - #a Asn Leu Pro Ala Pro     #   140     - Gly Val Leu Ser Ile Gly Gly Ala Leu Ala Va - #l Asn Ala His Gly Ala     145                 1 - #50                 1 - #55                 1 -     #60     - Ala Leu Pro Ala Val Gly Gln Thr Thr Leu Pr - #o Gly His Thr Tyr Gly     #               175     - Ser Leu Ser Asn Leu Val Thr Glu Leu Thr Al - #a Val Val Trp Asn Gly     #           190     - Thr Thr Tyr Ala Leu Glu Thr Tyr Gln Arg As - #n Asp Pro Arg Ile Thr     #       205     - Pro Leu Leu Thr Asn Leu Gly Arg Cys Phe Le - #u Thr Ser Val Thr Met     #   220     - Gln Ala Gly Pro Asn Phe Arg Gln Arg Cys Gl - #n Ser Tyr Thr Asp Ile     225                 2 - #30                 2 - #35                 2 -     #40     - Pro Trp Arg Glu Leu Phe Ala Pro Lys Gly Al - #a Asp Gly Arg Thr Phe     #               255     - Glu Lys Phe Val Ala Glu Ser Gly Gly Ala Gl - #u Ala Ile Trp Tyr Pro     #           270     - Phe Thr Glu Lys Pro Trp Met Lys Val Trp Th - #r Val Ser Pro Thr Lys     #       285     - Pro Asp Ser Ser Asn Glu Val Gly Ser Leu Gl - #y Ser Ala Gly Ser Leu     #   300     - Val Gly Lys Pro Pro Gln Ala Arg Glu Val Se - #r Gly Pro Tyr Asn Tyr     305                 3 - #10                 3 - #15                 3 -     #20     - Ile Phe Ser Asp Asn Leu Pro Glu Pro Ile Th - #r Asp Met Ile Gly Ala     #               335     - Ile Asn Ala Gly Asn Pro Gly Ile Ala Pro Le - #u Phe Gly Pro Ala Met     #           350     - Tyr Glu Ile Thr Lys Leu Gly Leu Ala Ala Th - #r Asn Ala Asn Asp Ile     #       365     - Trp Gly Trp Ser Lys Asp Val Gln Phe Tyr Il - #e Lys Ala Thr Thr Leu     #   380     - Arg Leu Thr Glu Gly Gly Gly Ala Val Val Th - #r Ser Arg Ala Asn Ile     385                 3 - #90                 3 - #95                 4 -     #00     - Ala Thr Val Ile Asn Asp Phe Thr Glu Trp Ph - #e His Glu Arg Ile Glu     #               415     - Phe Tyr Arg Ala Lys Gly Glu Phe Pro Leu As - #n Gly Pro Val Glu Ile     #           430     - Arg Cys Cys Gly Leu Asp Gln Ala Ala Asp Va - #l Lys Val Pro Ser Val     #       445     - Gly Pro Pro Thr Ile Ser Ala Thr Arg Pro Ar - #g Pro Asp His Pro Asp     #   460     - Trp Asp Val Ala Ile Trp Leu Asn Val Leu Gl - #y Val Pro Gly Thr Pro     465                 4 - #70                 4 - #75                 4 -     #80     - Gly Met Phe Glu Phe Tyr Arg Glu Met Glu Gl - #n Trp Met Arg Ser His     #               495     - Tyr Asn Asn Asp Asp Ala Thr Phe Arg Pro Gl - #u Trp Ser Lys Gly Trp     #           510     - Ala Phe Gly Pro Asp Pro Tyr Thr Asp Asn As - #p Ile Val Thr Asn Lys     #       525     - Met Arg Ala Thr Tyr Ile Glu Gly Val Pro Th - #r Thr Glu Asn Trp Asp     #   540     - Thr Ala Arg Ala Arg Tyr Asn Gln Ile Asp Pr - #o His Arg Val Phe Thr     545                 5 - #50                 5 - #55                 5 -     #60     - Asn Gly Phe Met Asp Lys Leu Leu Pro                     565     - (2) INFORMATION FOR SEQ ID NO: 30:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 36 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     #30:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #       36         GCCC GGTGGCGCCG CTTCCG     - (2) INFORMATION FOR SEQ ID NO: 31:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 25 base               (B) TYPE: Nukleins               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     #31:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #               25 GGTG ACGAT     - (2) INFORMATION FOR SEQ ID NO: 32:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 39 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     #32:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #    39            AAAC ATCGATGACC ATGATTACG     - (2) INFORMATION FOR SEQ ID NO: 33:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 25 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     #33:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #               25 GGTG ACGAT     - (2) INFORMATION FOR SEQ ID NO: 34:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 18 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     #34:  (xi) SEQUENCE DESCRIPTION: SEQ ID NO:     #  18              TG     __________________________________________________________________________ 

I claim:
 1. A purified peptide cholesterol oxidase, comprising the amino acid sequence shown in SEQ ID NO:2, wherein said peptide does not include a B. sterolicum signal sequence.
 2. Recombinant cholesterol oxidase comprising an N-terminal sequence selected from the group consisting of the sequences shown in SEQ ID NO 7, 9, 11, 13, 15 and
 17. 3. Recombinant cholesterol oxidase of claim 2 comprising a sequence selected from the group consisting of the sequences shown in SEQ ID NO 19, 21, 23, 25, 27 or
 29. 4. A DNA molecule, which codes for a peptide with cholesterol oxidase activity or a sequence which is complementary thereto and which is selected from the group consisting of:a) the DNA sequence shown in SEQ ID NO 1 or a DNA sequence which is complementary thereto, b) DNA sequences which hybridize with the DNA sequence shown in SEQ ID NO 1, and c) DNA sequences which code for a peptide with the same amino acid sequence as the amino acid sequences coded by the DNA sequences of a) and b),wherein said peptide is obtainable from B. sterolicum and can be expressed in an enzymatically active form in E. coli and wherein said DNA does not encode a B. sterolicum signal sequence.
 5. The DNA of claim 4, comprising a 5' sequence selected from the group consisting of the sequences shown in SEQ ID NO 6, 8, 10, 12, 14 and
 16. 6. The DNA of claim 4, comprising a sequence selected from the group consisting of the sequences shown in SEQ ID NO 18, 20, 22, 24, 26 and
 28. 7. The DNA of claim 4, comprising the sequence shown in SEQ ID NO
 1. 8. A process for the production of a recombinant cholesterol oxidase comprising:a) transforming a host cell with an expression vector comprising the DNA of claim 4, b) culturing the transformed host cells, and c) isolating the cholesterol oxidase formed from the cytoplasm of the transformed cells.
 9. The process of claim 8, wherein the DNA comprises a 5' sequence selected from the group consisting of the sequences shown in SEQ ID NO 6, 8, 10, 12, 14 and
 16. 10. The process of claim 8, wherein the DNA comprises a sequence selected from the group consisting of the sequences shown in SEQ ID NO 18, 20, 22, 24, 26, and
 28. 11. A method for the determination of cholesterol comprising combining the recombinant cholesterol oxidase according to claim 3 with a cholesterol containing sample under conditions suitable for the oxidation of cholesterol to cholesten-3-one and H₂ ₂. and determining cholesterol-based on the presence of cholesten-3-one and ₂ O₂.
 12. A purified peptide cholesterol oxidase, consisting of the amino acid sequence shown in SEQ ID NO:2. 